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Sir: 



PATENT APPLICATION TRANSMITTAL LETTER 

Transmitted herewith for filing, please find 

A Utility Patent Application under 37 C.F.R. 1.53(b). 
It is a continuing application, as follows: 

□ continuation divisional D continuation-in-part of prior, application number 

/ . 



□ 
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A Provisional Patent Application under 37 C.FlR. 1.53(c). 
A Design Patent Application (submitted in duplicate). 
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□ Provisional Application Cover Sheet 



New or Revised Specification, including pages _J_ to _$0_ containing: 
£3 



Specification 
Claims 
Abstract 



□ Substitute Specification, mctading Claims and Abstract 

□ The present application is a continuation application of Application 

j5L ed " 1116 preSent ^P Kcat »on includes the 

Specification of the parent application which has been revised in 

noTofZ Sled in the parent application. Since 

none of those amendments incorporate new matter into the parent 
application, the present revised Specification also does not include 



matter. 



new 



□ The present application is a continuation application of Application 

A ° n T- — - ffled > which in turn is a continuation-in-part of 

Application No. fiw m. 

-^ofthtp^^o^h^ 

revved in accordance with the amendments filed in the parent 
apphcation. Although the amendments in the parent C-l-P application 
may have incorporated new matter, since those are the a*J£££? 

new^ * T?" &e applicanonic^o 

new matter m relation to the parent application. 

A copy of earlier apphcation Serial No. Filed 

including Specification, Claims and Abstract (pages 1 - (3X3A tnwfe.vi. ' 

matter has been added TOGFTHKR wtto ^ } , ®@- ) ' to whlch no new 
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[H A Preliminary Amendment. 
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□ Petition to Accept Photographic Drawings. 
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3 An CD Executed ^ Unexecuted Declaration or Oath and Power of Attorney. 
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i: LJ An Associate Power of Attorney: 
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1 3 □ An □ Executed Q Copy of 'Executed^As^gimlent of the Invention to _ 

O ^ _ _ _ . : 8 . ,^ . 

I' * DTa Recordation Fonn Cover Sfieet: , 

□ Recordation Fee - $40.00. 

D The prior application is assigned of record to 



□ Priority is claimed under 35 U.S.C. § 1 19 of Patent Application No. 



filed in . (country). 

□ A Certified Copy of each of the above applications for which priority is 
claimed: 

□ is enclosed. 

□ has been filed in prior application Serial No. filed 



□ An □ Executed or □ Copy of Executed Earfier Statement Claiminjg .Small Entity 
Status under 37 C.F.R. 1.9 and 1.27 
□ is enclosed.; 

CD vhas beentfilediiit prior application Ser«d:No. . filed- 

said status is still proper and desired in present case. 
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Diskette Containing DNA/Amino Acid Sequence Information. 
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Statement to Support Submission of DNA/Amino ?\cid Sequence Information. 



D The computer readable form in this application , is identical with that filed in 

Application Serial Number , filed . In accordance with 37 CFR 

1 .821(e), please use the □ first-filed, □ last-filed or □ only computer readable 
form filed in that application as the computer readable form for the instant 
application. It is understood that the Patent and Trademark Office will make the 
necessary change in application number and filing date for the computer readable 
form that will be used for the instant application. A paper copy of the Sequence 

Listing is included in the originally-filed specification of the instant application, 

4 *5 Q included in a separately filed preliminary amendment for incorporation into the 

jjjj specification. 

3 LJ Information Disclosure Statement 

C3 n 

| j LI Attached Form 1449. 

Copies of each of the references listed on the attached Form PTO-1449 are 
s J enclosed herewith. 
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|^ □ A copy of Petition for Extension of Time as filed in the ] 



: prior case. 

□ Appended Material as follows: 
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Return Receipt Postcard (should be specifically itemized). 
Other as follows: 
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. of the prior application before 



Cancel in this application original claims 

calculating the filing fee. (At least one original independent claim must be retained 
for filing purposes.]) 
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A Check is enclosed in the' amount of $ 2150.00 



The Commissioner is authorized to charge payment of the following fees and to 
refund any overpayment associated with this communication or during the pendency 
of this application to deposit account 23-3050. This sheet is provided in duplicate. 



□ 
El 

El 

□ 



The foregoing amount due. 

Any additional filing fees required, including fees for the presentation of extra 
claims under 37 CRR. 1.16. 

Any additional patent application processing fees under 37 C.F.R. 1 . 17 or 
1.20(d). 

The issue fee set in 37 C.F.R. 1.18 at the mailing of the Notice of Allowance. 
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The Commissioner is hereby requested to grantan extension of time for the. : 
appropriate length of time, should one be necessary, in connection with this filing or 
any future filing submitted to the U.S. Patent and Trademark Office in the above- 
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identified application during the pendency of this application. The Commissioner is 
further authorized to charge any fees related to any such extension of time to deposit 
account 23-3050. This sheet is provided in duplicate. 

SHOULD ANY DEFICIENCIES APPEAR with respect to this application, including 
deficiencies in payment of fees, missing parts of the application or otherwise, the United 
States Patent and Trademark Office is respectfully requested to promptly notify the 
undersigned. 

Date: JuJLul.lW U»»ii-\f) 

\ Mark J. Rosen 

Registration No. 39,822 

£ 3 Woodcock Washburn Kurtz 

Mackiewicz & Noiris LLP 
* i One Liberty Place - 46th Floor 
p Philadelphia PA 19103 
H Telephone: (215) 568-3100 
C3 Facsimile: (215)568-3439 
fj 

£3 

£5 
<J 

O 
».3 



O 1997 WWKMN 



TANKYRASE HOMOLOG PROTEIN (THP), NUCLEIC ACIDS, AND 
METHODS RELATED TO THE SAME 

FIELD OF THE INVENTION 

5 The Present invention is directed, in part, to nucleic acid molecules 

Jj encoding tankyrase homolog protein (THP), novel THP polypeptides, and assays for 

U screening compounds which bind to THP and/or modulate the activity of THP 

U 

a 

*J BACKGROUND OF THE INVENTION 

H 10 Telomeres-, the physical ends of eukaryptic chromosomes, consist of short 

„ repeated nucleotide sequences that arc characteristic for all telomericDNA, from yeast to 

{j human (.see, e.g., Harlejne^a/., CurrrOpin. Genet. Dev:, 1995, 5, 249)7: Telomeres are 

p essential for meintegrity of chromosomes (see; e.g,, Lundblad et id -Cell, 1996, 87, 369), 

<>3 since DNA * polymerases- arerincapaible of replicating the, very end . of . DNA. . This 

15 replication defect affects the telomeres wmch are progressively shorts each cell 

cycle, and as they reach a critical short length, cells will die (see,e.g. Goldstein, Science, 
1990, 249, 1 129). In some cells, telomere length is preservedijy a ribonucleoprotein 
reverse transcriptase, referred to as "telomerase," which adds the characteristic telomere 
repeats (Greider et al, Cell, 1985, 43, 405). Telomerase is not expressed in most somatic 
20 cells and, as a consequence, these cells have a finite life-span. Extensive studies in a broad 
range of tumors indicate that 85-95% of all tumors cells express telomerase (see, e.g., 
Shay et al., Eur. J. Cancer, 1997, 33, 787). Telomerase expression is an early event in 
tumor progression and is often present at the premalignant stage. Telomerase expression 
alone is not sufficient for tumorogenesis, but it is likely to be an essential permissive event 
25 to allow the tumor to continue to grow. 

In addition to telomerase; another protein has been identified as capable of 
regulating telomere length, referred to as telomere repeat binding factor .1 (TRF1), that 
binds as. a dimer to -double stranded felomeric repeats (Chong et al , -Science, 1995, 270, 
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1663). Overexpression of TRF1 in a telomerase-expressing ceil line leads to progressive 
telomere shortening, whereas inhibition of TRF1 increases telomere length (van Steensell 
et ah, Nature, 1991 \ 385, 740). TRF1 does not control the expression of telomerase itself 
but, rather, is thought to act by inhibiting telomerase action at the telomere termini (Smith 
5 et al 9 Science, 1998, 282, 1484). 

The ability of TRF1 to bind to telomeric DNA can be inhibited by its 
modification through the action of an enzyme called tankyrase, which covalently modifies 
TRF1 by ADP-ribosylation (Smith et al. 9 Science, 1998, 282, 1484). Tankyrase is a 
multidomain protein characterized by the presence of four distinct domains and are listed 
10 here in order from the polypeptide NH 2 -terminus to the caiboxy-terminus; HPS, ANK, 
SAM, and PARP. The HPS domain is a region containing homopolymeric runs of His, 
Pro, and Ser and has unknown function. The ANK domain contains 24 repeat units of a 
3 3 -amino acid ANK motif^ as found in ankyrins. This domain is responsible for the 
binding of tankyrase to TRF1. The SAM domain (sterile alpha module) has unknown 
£3 15 function but is believed to contribute to protein-protein interactions. The C-terminal 

I j catalytic active region is homologous to the active region of human Poly(ADP-Ribose) 

y polymerase (PARP) enzymes. PARP catalyzes the formation of poly(ADP-ribose) onto a 

12 suitable protein receptor using NAD + as substrate. The present invention involves the 

C3 surprising discovery of a novel polypeptide, herein designated TOP, that exhibits at least 

20 one functional homology to human tankyrase, and its role as a key component, for 
65 example, in regulating telomerase activity and thereby telomere length. TOP is, thus, 

^ useful in the search for novel anti-tumor agents that can modify and/or control telomere 

**3 length and/or telomere maintenance. These and other aspects of the invention are 

*« 3 described below. 
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SUMMARY OF THE INVENTION 

Hie present invention is directed to, in part, isolated nucleic acid molecules 
comprising SEQ ID NO:3, or a fragment thereof, SEQ ID NO:4, or a fragment thereof; a 
nucleotide sequence complementary to at least a portion of SEQ ID NO:3 or SEQ ID 
30 NO:4; a nucleotide sequence homologous to SEQ ID NO:3 or SEQ ID NO:4, or a 
fragment thereof; a nucleotide sequence that encodes a polypeptide comprising SEQ ID 
NO: 5, or a fragment thereof; or a nucleotide sequence that encodes a polypeptide 
comprising an amino acid sequence homologous to SEQ ID NO:5, or a fragment thereof. 

The present invention is also directed to recombinant expression vectors 
35 comprising any of the nucleic acid molecules described above. 
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The present invention is also directed to host cells transformed with a 
recombinant expression vector comprising any of the nucleic acid molecules described 
above. 

The present invention is also directed to methods of producing a 
polypeptide comprising SEQ ID NO:5, or a homolog or fragment thereof, by introducing a 
recombinant expression vector comprising any of the nucleic acid molecules described 

the polypeptide, and recovering the polypeptide from the host cell. 

The present invention is also directed to compositions comprising any of 
the nucleic acid molecules described above and an acceptable carrier or diluent 

The present invention is also directed to isolated polypeptides encoded by 
any of the nucleic acid molecules described above. 

The present invention is also directed to compositions comprising a 
polypeptide encoded by any of the nucleic acid molecules described above and an 
acceptable carrier or diluent. 

The present invention is also directed to isolated antibodies which bind to 
an epitope on a polypeptide encoded by any of the nucleic acid molecules described above. 

The present invention is also directed to kits comprising antibodies which 
bind to a polypeptide encoded by any of the nucleic acid molecules desmbed above and a 
negative control antibody. 

The present invention is also directed to methods of inducing an immune 
response, in a mammalagainst a polypeptide encoded by^ 

described above by adniinistering to the mammal an amount of the polypeptide sufficient 
to induce the immune response.;:. . 

The present invention lis also directed to methods of, identifying a 
compound which binds- to tankyrase homolog protein (THP) by contacting THP with a 
compound, and determining whether the compound binds THP. 

The present invention is also directed to methods of identifying a 
compound which binds a nucleic acid molecule encoding tankyrase homolog protein 
(THP) by contacting THP with a compound, and determining whether the compound binds 
the nucleic acid molecule. 

The present invention is also directed to methods of identifying a 
compound which modulates the activity of tankyrase homolog protein (THP) by 
contacting THP with a compound, and determining whether THP activity is modified. 

The present invention is also directed to compounds which modulateiTHP 
activity identified by: contacting. THP with the compound, and determining whether the 
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compound modifies activity of THP, binds to THP, or binds to a nucleic acid molecule 
encoding THP. 

These and other aspects of the invention are described in greater detail 

below. 

5 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

The present invention provides, inter alia, isolated and purified 
polynucleotides that encode THP or a portion thereof, vectors containing these 
polynucleotides, host cells transformed with these vectors, processes of making THP, 
10 methods of using the above polynucleotides and vectors, isolated and purified THP, and 
methods of screening compounds which modulate THP activity. 

Various definitions are made throughout this document- Most words have 
the meaning that would be attributed to those words by one skilled in the art. Words 
specifically defined either below or elsewhere in this document have the meaning provided 
□ 15 in the context of the present invention as a whole and as are typically understood by those 
j : j skilled in the art. 

11 As used herein, the term "activity" "refers to a variety of measurable indicia 

; ! \ suggesting or revealing binding, either direct or indirect; affecting a response, i.e. having a 

r j measurable affect in response to some exposure or stimulus, including, for example, the 

20 affinity of a compound for directly binding a polypeptide or polynucleotide of the 
rj invention, or, for example, measurement of amounts of upstream or downstream proteins 

4 or other similar functions after some stimulus or event 

f j 

J . As used herein, the abbreviation in lower case (Thp) refers to a gene, 

*,3 cDNA, RNA or nucleic acid sequence while the upper case version (THP) refers to a 

25 protein, polypeptide, peptide, oligopeptide, or amino acid sequence. 

As used herein, the term "antibody" is meant to refer to complete, intact 
antibodies, and Fab fragments and F(ab>2 fragments thereof. Complete, intact antibodies 
include monoclonal antibodies such as murine monoclonal antibodies, chimeric antibodies 
and humanized antibodies. 

30 As used herein, the term "binding" means the physical or chemical 

interaction between two proteins or compounds or associated proteins or compounds or 
combinations thereof. Binding includes ionic, non-ionic, Hydrogen bonds, Van der Waals, 
hydrophobic interactions, etc. The physical interaction, the binding, can be either direct or 
indirect, indirect being through or due to the effects of another protein or compound. 

35 Direct binding refers to interactions that do not take place through or due to the effect of 
another protein or compound but instead are without other substantial chemical 
intermediates. 
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As used herein, the term "compound" means any identifiable chemical or 
molecule, including, but not limited to, small molecule, peptide, protein, sugar, nucleotide, 
or nucleic acid, and such compound can be natural or synthetic. 

As used herein, the term "complementary" refers to Watson-Crick 
5 basepairing between nucleotide units of a nucleic acid molecule. 

AS " S6d herein > * e ! erm "contacting" means bringing U>SStt»SU>fiithfi&- 
directly or indirectly, a compound into physical proximity to a polypeptide or 
polynucleotide of the invention. The polypeptide or polynucleotide can be in any number 
of buffers, salts, solutions etc. Contacting includes, for example, placing the compound 
10 into a beaker, microtiter plate, cell culture flask, or a microarray, such as a gene chip, or 
the like, which contains the nucleic acid molecule or polypeptide encoding THP or a 
fragment thereof. 

As used herein, the phrase "homologous nucleotide sequence," or 
"homologous amino acid sequence," or variations thereof, refers to sequences 
t J 15 characterised by a homology, at the nucleotide level or amino acid level, of at least about 

y 60%, more preferably at least about 70%, more preferably at least about 80%, more 

il preferably at least about 90%, and most preferably at least about 95% to the entire SEQ ID 

H NO:3 or SEQ ID NO:4, or to at least a portion of SEQ ID NO:3 or SEQ ID NO:4 which 

encodes a functional domain of the^ encoded polypeptide, or to SEQ ID NO:5. 
20 Homologous nucleotide sequencesnnclude those sequences coding for isoforms of THP 
3 proteins. Such isoforms can oe expressed m-different tissues of the.same organism as a 

J result of, for example, alternative sphcmg-of RNA7 Alternatively, isoforms can be 

q encoded hy different genes: ' Homologous nucleotide sequences ^include nucleotide 

sequences encoding for a THFprotein of species other than humans, including, but not 
25 limited to, mammals. Homologous nucleotide sequences also include, but are not limited 
to, naturally occurring allelic variations , and: mutations * of the nucleotide sequences set 
form herein. A homologous nucleotide sequence does not, however, include the 
nucleotide sequence encoding human tankyrase. Homologous amino acid sequences 
include those amino acid sequences which encode conservative amino acid substitutions in 
30 SEQ ID NO:5, as well as polypeptides having tankyrase activity. A homologous amino 
acid sequence does not, however, include the amino acid sequence encoding human 
tankyrase. Percent homology is preferably determined by, for example, the Gap program 
(Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group 
University Research Park, Madison WI), using the default settings, which uses the 
35 algorithm of Smith and Waterman (Adv. Appl. Math., 1981, 2, 482-489, which is 
incorporated herein by reference in its entirety). 
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As used herein, the term "isolated" nucleic acid molecule refers to a nucleic 
acid molecule (DNA or RNA) that has been removed from its native environment. 
Examples of isolated nucleic acid molecules include, but are not limited to, recombinant 
DNA molecules contained in a vector, recombinant DNA molecules maintained in a 
5 heterologous host cell, partially or substantially purified nucleic acid molecules, and 
synthetic DNA or RNA molecules. 

As used herein, the terms "modulates" or '^modifies" means an increase or 
decrease in the amount, quality, or effect of a particular activity or protein. 

As used herein, the term "oligonucleotide" refers to a series of linked 
10 nucleotide residues which has a sufficient number of bases to be used in a polymerase 
chain reaction (PCR). This short sequence is based on (or designed from) a genomic or 
cDNA sequence and is used to amplify, confirm, or reveal the presence of an identical, 
similar or complementary DNA or RNA in a particular cell or tissue. Oligonucleotides 
comprise portions of a DNA sequence having at least about 10 nucleotides and as many as 
! >3 15 about 50 nucleotides, preferably about 15 to 30 nucleotides. They are chemically 

t* j synthesized and may be used as probes. 

!;] As used herein, the term "probe" refers to nucleic acid sequences of 

*jj variable length, preferably between at least about 10 and as many as about 6,000 

i-3 nucleotides, depending on use. They are used in the detection of identical, similar, or 

20 complementary nucleic acid sequences. Longer length probes are usually obtained from a 
□ natural or recombinant source, are highly specific and much slower to hybridize than 
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oligomers. They may be single- or double-stranded and carefully designed to have 



\ t 1 specificity in PCR, hybridization membrane-based, or ELIS A-like technologies. 

v jj As used herein, the phrase "stringent hybridization conditions" or "stringent 

25 conditions" refers to conditions under which a probe, primer, or oligonucleotide will 
hybridize to its target sequence, but to no other sequences. Stringent conditions are 
sequence-dependent and will be different in different circumstances. Longer sequences 
hybridize specifically at higher temperatures. Generally, stringent conditions are selected 
to be about 5 C lower than the thermal melting point (T^) for the specific sequence at a 
30 defined ionic strength and pH. The T m is the temperature (under defined ionic strength, 
pH and nucleic acid concentration) at which 50% of the probes complementary to the 
target sequence hybridize to the target sequence at equilibrium. Since the target sequences 
are generally present in excess, at T^, 50% of the probes are occupied at equilibrium. 
Typically, stringent conditions will be those in which the salt concentration is less than 
35 about 1 .0 M sodium ion, typically about 0.01 to 1 .0 M sodium ion (or other salts) at pH 7.0 
to 8.3 and the temperature is at least about 30 C for short probes, primers or 
oligonucleotides (e.g. 10 to 50 nucleotides) and at least about 60 C for longer probes, 
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primers or oligonucleotides. Stringent conditions may also be achieved with the addition 
of destabilising agents, such as formamide: 

The amino acid sequences are presented in the amino to carboxy direction, 
from left to right. The amino and carboxy groups are not presented in the sequence. The 
5 nucleotide sequences are presented by single strand only, in the 5' to 3' direction, from left 
to right. Nucleotides and amino acids are represented in the manner recommended by the 
IUP AC-IUB Biochemical Nomenclature Commission, or (for amino acids) by three letters 
code. 

One aspect of the present invention is directed to nucleic acid molecules 
10 comprising novel nucleotide sequences encoding THP. The nucleic acid molecules are 
preferably either RNA or DNA, but may contain both RNA and DNA monomers or 
peptide nucleic acid monomers. The nucleic acid molecule may be single stranded or 
double stranded. The monomers of the nucleic acid molecules may be linked via 
conventional phosphodiester bonds or modified bonds, such as, for example, 
15 phosphorothio ate bonds and the like. In addition, the sugar moieties of the monomers may 
i ] be modified by, for example, addition of T . substitutions which help confer nuclease 

1 1 resistance and/or cellular uptake. 

* jj In a preferred embodiment of the invention, the nucleic acid molecule 
£l comprises SEQ ID NO:3, which is 4512 bases in length and comprises an open reading 

20 frame (ORF) of approximately 3498 nucleotides (from about position 23 to about position 
U 3520 within SEQ J© N03); which encodes human THP. Alternatively, the nucleic acid 

* i molecule comprises a fragment of SEQ ID NO:3: Preferably, the fragment comprises 
lh from about 10 to about 100 nucleotides, from about 101; to about 200 nucleotides, from 
M about 201 to about 300 nucleotides, froin about 301 to about 400 nucleotides, from about 

25 401 to about 500 nucleotides, from about 501 to about 600 nucleotides, from about 601 to 
about 700 nucleotides, from about 701 to about 800 nucleotides; from about 801 to about 
900 nucleotides, from about 901 to about 1000 nucleotides, from about 1001 to about 1 100 
nucleotides, from about 1101 to about 1200 nucleotides, from about 1201 to about 1300 
nucleotides, from about 1301 to about 1400 nucleotides, from about 1401 to about 1500 

30 nucleotides, from about 1501 to about 1600 nucleotides, from about 1601 to about 1700 
nucleotides, from about 1701 to about 1800 nucleotides, from about 1801 to about 1900 
nucleotides, from about 1901 to about 2000 nucleotides, from about 2001 to about 2100 
nucleotides, from about 2101 to about 2200 nucleotides, from about 2201 to about 2300 
nucleotides, from about 2301 to about 2400 nucleotides, from about 2401 to about 2500 

35 nucleotides, from about 2501 to about 2600 nucleotides, from about 2601 to about 2700 
nucleotides, from about 2701 to about 2800 nucleotides, from about 2801 to about 2900 
nucleotides, from about 2901 to about 3000 nucleotides, from about 3001 to about 3100 
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nucleotides, from about 3101 to about 3200 nucleotides, from about 3201 to about 3300 
nucleotides, from about 3301 to about 3400 nucleotides, from about 3401 to about 3500 
nucleotides, from about 3501 to about 3600 nucleotides, fiom about 3601 to about 3700 
nucleotides, from about 3701 to about 3800 nucleotides, from about 3801 to about 3900 
5 nucleotides, from about 3901 to about 4000 nucleotides, from about 4001 to about 4100 
nucleotides, from about 4101 to about 4200 nucleotides, from about 4201 to about 4300 
nucleotides, from about 4301 to about 4400 nucleotides, from about 4401 to about 4500 
nucleotides, or from about 4501 to about 4511 nucleotides, and any combinations thereof. 
The fragment can be located within any portion of SEQ ID NO:3. 
10 111 another preferred embodiment of the invention, the nucleic acid 

molecule comprises SEQ ID NO:4, which is 3498 bases in length and comprises the ORF 
(from about position 23 to about position 3520 within SEQ ID NO:3) described above. 
Alternatively, the nucleic acid molecule comprises a fragment of SEQ ID NO:4. 
Preferably, the fragment comprises from about 10 to about 100 nucleotides, fiom about 
* J 15 101 to about 200 nucleotides, from about 201 to about 300 nucleotides, from about 301 to 
y about 400 nucleotides, from about 401 to about 500 nucleotides, from about 501 to about 

11 600 nucleotides, from about 601 to about 700 nucleotides, from about 701 to about 800 

O nucleotides, from about 801 to about 900 nucleotides, from about 901 to about 1000 

O nucleotides, from about 1001 to about 1 100 nucleotides, from about 1 101 to about 1200 

n 20 nucleotides, from about 1201 to about 1300 nucleotides, from about 1301 to about 1400 

Cj nucleotides, from about 1401 to about 1500 nucleotides, from about 1501 to about 1600 

0 - nucleotides, from about 1601 to about 1700 nucleotides, from about 1701 to about 1800 

,3 nucleotides, from about 1801 to about 1900 nucleotides, from about 1901 to about 2000 

'■3 nucleotides, from about 2001 to about 2100 nucleotides, fiom about 2101 to about 2200 

nucleotides, from about 2201 to about 2300 nucleotides, from about 2301 to about 2400 
nucleotides, from about 2401 to about 2500 nucleotides, from about 2501 to about 2600 
nucleotides, from about 2601 to about 2700 nucleotides, from about 2701 to about 2800 
nucleotides, from about 2801 to about 2900 nucleotides, from about 2901 to about 3000 
nucleotides, from about 3001 to about 3100 nucleotides, from about 3101 to about 3200 
nucleotides, from about 3201 to about 3300 nucleotides, from about 3301 to about 3400 
nucleotides, or from about 3401 to about 3497 nucleotides, and any combinations thereof. 
The fragment can be located within any portion of SEQ ID NO:4. 

In another preferred embodiment of the invention, the nucleic acid 
molecule comprises a nucleotide sequence complementary to at least a portion of SEQ ID 
NO:3 or SEQ ID NO:4. Preferably, the nucleic acid molecule comprises a nucleotide 
sequence complementary to the entire sequence recited in SEQ ID NO:3 or SEQ ID NO"4 
Alternatively, the nucleic acid molecule comprises a nucleotide sequence complementary 
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to a portioTTof SEQ ID NO:3 or SEQ ID NO:4 (/.Complementary to any of the 
fragments described above). Nucleotide sequences complementary to at least a portion of 
SEQ ID NO:3 or SEQ ID NO:4 include, for example, oligonucleotides which hybridize 
under stringent hybridization conditions to at least a portion of SEQ ID NO:3 or SEQ ID 
NO:4. Preferred oligonucleotides comprise at least about 10 nucleotides and as many as 
about 50 nucleotides, preferably about 15 to 30 nucleotides. They are chemically 

In another preferred embodiment of the invention, the nucleic acid 
molecule comprises a nucleotide sequence homologous to SEQ ID NO:3 or SEQ ID NO:4 
Preferably, the nucleotide sequence is at least about 75% homologous, more preferably at 
least about 80% homologous, more preferably at least about 90% homologous, and most 
preferably at least about 95% homologous to the entire SEQ ID NO:3 or SEQ ID NO:4. 
Alternatively, the nucleotide sequence is at least about 60% homologous, more preferably 
at least about 70% homologous, more preferably at least about 80% homologous, more 
preferably at least about 90% homologous, and most preferably at least about 95% 
homologous to a portion of SEQ ID NO:3 or SEQ ID NO:4 which encodes a Junctional 
domain of the polypeptide encoded thereby. In addition, a nucleotide sequence 
homologous to SEQ ID NO:3 or SEQ ID NO:4 also includes a fragment of the nucleotide 
sequence homologous to SEQ ID NO:3 or SEQ ID NO:4 of the lengths described above. 

In another preferred embodiment of the invention,- the " nucleic acid 
molecule comprises a nucleotide sequence that encodes a polypeptide comprising SEQ ID 
NO:5r The nucleic acid: molecule preferably comprises SEQID NO* or;coihprises SEQ 
ro>IO?4 contaiinngcodp^^^ 

As .is well known in the art, because of the degeneracy of the; genetic' code, there axe 
numerous other DNA and RNA molecules that can code for the same polypeptide as that 
encoded by SEQ ID NO:4. The present invention, therefore, contemplates these other 
DNA and RNA molecules which, on expression, encode the polypeptide of SEQ ID NO:5 
DNA and RNA molecules other than those specifically disclosed herein characterised 
sunply by a change in a codon for a particular amino acid, are within the scope of the 
30 present invention. 

As is well known in the art, because of the degeneracy of the genetic code 
there are numerous other DNA and RNA molecules that can code for the same polypeptide 
as that encoded by the aforementioned THP gene. The present invention, therefore 
contemplates those other DNA and RNA molecules which, on expression, encode the 
35 polypeptide of SEQ ID NO: 4. Having identified the amino acid residue sequence encoded 
by a Thp gene, and with knowledge of all triplet codons for each particular amino acid 
resxdue, « is possible to describe-all such encoding RNA and DNA sequences. DNA and 
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RNA molecules other than those specifically disclosed herein characterised simply by a 
change in a codon for a particular amino acid, are within the scope of this invention. 

A table of amino acids and their representative abbreviations, symbols and 
codons is set forth below in the following Table L 
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Amino acid 


j Abbrev. 


Table 1 
| Symbol | 




Codon(s) 


Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 


Cysteine 


Cys 


C 


UGA 


UGU 






Aspartic acid 


Asp 


D 


GAC 


GAU 






Glutamic acid 


Glu 


E 


GAA 


GAG 






Phenylalanine 


Phe 


F 


UUC 


UUU 






Glycine 


Gly 


G 


GGA 


GGC 


GGG 




Histidine 


His 


H 


CAC 


CAU 




Isoleucine 


He 


I 


AUA 


AUC 


AUU 




Lysine 


Lys 


K 


AAA 


AAG 






Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


Methionine 


Met 


M 


AUG 






Asparagine 


Asn 


N 


AAC 


AAU 






Proline 


Pro 


P 


CCA 


ccc 


CCG 


ecu 


Glutamine 


Gin 


Q 


CAA 


CAG 




Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


ucc 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 


Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 


Tryptophan 


Trp 


W 


UGG 






Tyrosine 


Tyr 


Y 


UAC 


UAU 







CUG CUU 



CGG 
UCG 



CGU 
UCU 



As is well known in the art, codons constitute triplet sequences of 
nucleotides in mRNA molecules and, as such, are characterised by the base uracil (U) in 
place of base thymidine (T) (which is present in DNA molecules). A simple change in a 
codon for the same amino acid residue within a polynucleotide will not change the 
sequence or structure of the encoded polypeptide. 

Alternatively, the nucleic acid molecule comprises a nucleotide sequence 
that encodes a fragment of the polypeptide encoding SEQ ID NO:5. Preferably, the 
fragment comprises from about 5 to about 20 amino acids, from about 21 to about 40 
amino acids, from about 41 to about 60 amino acids, from about 61 to about 80 amino 
acids, from about 81 to about 100 amino acids, from about 101 to about 120 amino acids, 
from about 121 to about 140 amino acids, from about 141 to about 160 amino acids, from 
about 161 to about 180 amino acids, from about 181 to about 200 amino acids, from about 
201 to about 220 amino acids, from about 221 to about 240 amino acids, from about 241 to 
about 260 amino acids, from about 261 to about 280 amino acids, from about 281 to about 
300 amino acids, from about 301 to about 320 amino acids, from about 321 to about 340 
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amino acids^rom about 341 to about 360 amino acids, from about 361 to about 380 amino 
acids, from about 381 to about 400 amino acids, from about 401 to about 420 amino acids, 
from about 421 to about 440 amino acids, from about 441 to about 460 amino acids, from 
about 461 to about 480 amino acids, from about 481 to about 500 amino acids, from about 
501 to about 520 amino acids, from about 521 to about 540 amino acids, from about 541 to 
about 560 amino acids, from about 561 to about 58~0 amino acids, from about 581 to about 

600 aminn ariric .Wvhn.^nr^ „i ^l^mtmni^MBm^MmS^Wm} JLJU J4t 

amino acids, from about 641 to about 660 amino acids, from about 661 to about 680 amino 
acids, from about 681 to about 700 amino acids, from about 701 to about 720 amino acids, 
from about 72 1 to about 740 amino acids, from about 741 to about 760 amino acids, from ' 
about 761 to about 780 amino acids, from about 781 to about 800 amino acids, from about 
801 to about 820 amino acids, from about 821 to about 840 amino acids, from about 841 to 
about 860 amino acids, from about 861 to about 880 amino acids, from about 881 to about 
900 amino acids, from about 901 to about 920 amino acids, from about 921 to about 940 
a is amino acids, from about 941 to about 960 amino acids, from about 961 to about 980 amino 
y acids, from about 981 to about 1000 amino acids, from about 1001 to about 1020 amino 

11 acids, from about 1021 to about 1040 amino acids, from about 1041 to about 1060 amino 

;*| acids, from about 1061 to about 1080 amino acids, from about 1081 to about 1 100 amino 

C3 acids, fromabout 1101 to about 1120 amino acids, from about 1121 to about 1140 amino 

20 acids, from about 1141 to about 1 1 60 amino acids, or from about 1 161 to about 1 165 
C 3 amino acids, and any combinations thereof. The fragment can be located within any 

« portion of SEQ ID NO:5. 

J In another preferred embodiment of the invention, the nucleic acid 

molecule comprises;a nucleotide sequence that encodes a polypeptide comprising an 
25 amino acid sequence homologous to SEQ ID NO:5. Alternatively, the nucleic acid , 
molecule comprises a nucleotide sequence that encodes* fragment of the polypeptide 
comprising an amino acid sequence homologous to SEQ ID NO:5. Preferably, the 
fragment comprises from about 5 to about 20 amino acids, from about 21 to about 40 
amino acids, from about 41 to about 60 amino acids, from about 61 to about 80 amino 
30 acids, from about 81 to about 100 amino acids, from about 101 to about 120 amino acids, 
from about 121 to about 140 amino acids, from about 141 to about 160 amino acids, from 
about 161 to about 180 amino acids, from about 181 to about 200 amino acids, from about 
201 to about 220 amino acids, from about 221 to about 240 amino acids, from about 241 to 
about 260 amino acids, from about 261 to about 280 amino acids, from about 281 to about 
300 amino acids, from about 301 to about 320 amino acids, from about 321 to about 340 
amino acids, from about 341 to about 360 amino acids, from about 361 to about 380 amino 
acids, fromabout 381 to about -400 amino acids, from about 40i; to abo«.420 amino acids 
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from about 421 to about 440 amino acids, from about 441 to about 460 amino acids, from 
about 461 to about 480 amino acids, from about 481 to about 500 amino acids, from about 
501 to about 520 amino acids, from about 521 to about 540 amino acids, from about 541 to 
about 560 amino acids, from about 561 to about 580 amino acids, from about 581 to about 
600 amino acids, from about 601 to about 620 amino acids, from about 621 to about 640 
amino acids, from about 641 to about 660 amino acids, from about 661 to about 680 amino 
acids, from about 681 to about 700 amino acids, from about 701 to about 720 amino acids, 
from about 721 to about 740 amino acids, from about 741 to about 760 amino acids, from 
about 761 to about 780 amino acids, from about 781 to about 800 amino acids, from about 
801 to about 820 amino acids, from about 821 to about 840 amino acids, from about 841 to 
about 860 amino acids, from about 861 to about 880 amino acids, from about 881 to about 
900 amino acids, from about 901 to about 920 amino acids, from about 921 to about 940 
amino acids, from about 941 to about 960 amino acids, from about 961 to about 980 amino 
acids, from about 981 to about 1000 amino acids, from about 1001 to about 1020 amino 
acids, from about 1021 to about 1040 amino acids, from about 1041 to about 1060 amino 
acids, from about 1061 to about 1080 amino acids, from about 1081 to about 1 100 amino 
acids, from about 1101 to about 1120 amino acids, from about 1121 to about 1140 amino 
acids, from about 1141 to about 1160 amino acids, or from about 1161 to about 1165 
amino acids, and any combinations thereof. The fragment can be located within any 
portion of SEQ ID NO:5. 

With the knowledge of the nucleotide sequence information disclosed in the 
present invention, one skilled in the art can identify and obtain nucleotide sequences which 
encode THP from different sources (Le. 7 different tissues or different organisms) through a 
variety of means well known to the skilled artisan and disclosed by, for example, 
Sambrook et aL, "Molecular cloning: a laboratory manual", Second Edition, Cold Spring 
Harbor Press, Cold Spring Harbor, NY (1989), which is incorporated herein by reference 
in its entirety. 

For example, DNA which encodes the THP may be obtained by screening 
of mRNA, cDNA, or genomic DNA with oligonucleotide probes generated from the Thp 
gene sequence information provided herein. Probes may be labeled with a detectable 
group, such as a fluorescent group, a radioactive atom or a chemiluminescent group in 
accordance with procedures known to the skilled artisan and used in conventional 
hybridization assays, as described by, for example, Sambrook et al. 

A nucleic acid molecule comprising any of the Thp nucleotide sequences 
described above can alternatively be recovered by use of the polymerase chain reaction 
(PCR) procedure, with the PCR oligonucleotide primers produced from the nucleotide 
sequences provided herein. See U.S. Patent Numbers 4,683,195 to Mullis et al. and 
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4,683,202 to Mullis. The PCR reaction provides a method for selectively increasing the 
concentration of a particular nucleic acid sequence even when that sequence has not been 
previously purified and is present only in a single copy in a particular sample. The method 
can be used to amplify either single- or double-stranded DNA- The essence of the method 
5 involves the use of two oligonucleotides probes to serve as primers for the template- 
dependent, polymerase mediated replication of a desired nucleic acid molecule. 

A wide variety of alternative cloning and in vitro amplification 
methodologies are well known to those skilled in the art Examples of these techniques 
are found in, for example, Berger et aL, Guide to Molecular Cloning Techniques, Methods 
10 in Enzymology 152 Academic Press, Inc., San Diego, CA (Berger); which is incorporated 
herein by reference in its entirety. 

The nucleic acid molecules of the present invention, and fragments derived 
therefrom, are useful for screening for restriction fragment length polymorphism (RFLP) 
associated with certain disorders, as well as for genetic mapping. 
E 3 15 Antisense oligonucleotides, or fragments of SEQ ID NO:3 or SEQ ID 

i i NO:4, or sequences complementary thereto, derived from the nucleotide sequences of the 

1 1 present invention encoding THP are useful as diagnostic tools for probing Thp gene 

\ % expression in various tissues. For example, tissue can be probed in situ with 

I; 3 oligonucleotide probes carrying detectable groups by conventional autoradiography. 

20 techniques to investigate native expression of this enzyme or pathological conditions 
i; 3 relating thereto, Antisense oligonucleotides are preferably directed to regulatory regions 

li of SEQ ID NO:3 or SEQ ID~NO:4 or mRNA coiresponding thereto, including, but not 

; 3 limited to, the initiation codon, JATA box, enhancer sequences; and the Klcer^ 

*■ 3 Automated sequencing methods were used to obtain or verify the 

25 nucleotide sequence of THP. The THP hucleotide sequences of the present invention were 
obtained for both DNA strands, and are believed to be 100% accurate. However, as is 
known in the art, nucleotide sequence obtained by automated methods may contain some 
errors. Nucleotide sequences determined by automation are typically at least about 90%, 
more typically at least about 95% to at least about 99.9% identical to the actual nucleotide 
30 sequence of a given nucleic acid molecule. The actual sequence may be more precisely 
determined using manual sequencing methods, which are well known in the art An error 
in sequence which results in an insertion or deletion of one or more nucleotides may result 
in a frame shift in translation such that the predicted amino acid sequence will differ from 
that which would be predicted from the actual nucleotide sequence of the nucleic acid 
35 molecule, starting at the point , of the mutation. 

Another aspect of the presentinvention is directed to vectors, or 
recombinant expression vectors, comprising any of the nucleic acid molecules described 



above. Vectors are used herein either to amplify DNA or RNA encoding THP and/or to 
express DNA which encodes THP. Preferred vectors include, but are not limited to, 
plasmids, phages, cosmids, viral particles or viruses, and integratable DNA fragments (i.e., 
fragments integratable into the host genome by homologous recombination). Preferred 
5 viral particles include, but are not limited to, adenoviruses, parvoviruses, herpesviruses, 
poxviruses, adeno-associated viruses, Semliki Forest viruses, vaccinia viruses, and 
retroviruses. Preferred expression vectors include, but are not limited to, pcDNA3 
(Invitrogen) and pSVL (Pharmacia Biotech). Other expression vectors include, but are not 
limited to, pSPORT vectors, pGEM vectors (Promega), pPROEXvectors (LTI, Bethesda, 
10 MD), Bluescript vectors (Stratagene), pQE vectors (Qiagen), pSE420 (Invitrogen), and 
pYES2 (Invitrogen). 

Preferred expression vectors are replicable DNA constructs in which a 
DNA sequence encoding THP is operably linked to suitable control sequences capable of 
effecting the expression of the THP in a suitable host DNA regions are operably linked 
15 when they are functionally related to each other. For example, a promoter is operably 
linked to a coding sequence if it controls the transcription of the sequence. Amplification 
vectors do not require expression control domains, but rather need only the ability to 
replicate in a host, usually conferred by an origin of replication, and a selection gene to 
facilitate recognition of transformants. The need for control sequences into the expression 
vector will vary depending upon the host selected and the transformation method chosen. 
Generally, control sequences include a transcriptional promoter, an optional operator 
sequence to control transcription, a sequence encoding suitable mRNA ribosomal binding, 
and sequences which control the termination of transcription and translation. 

Preferred vectors preferably contain a promoter which is recognised by the 
host organism. The promoter sequences of the present invention may be either 
prokaryotic, eukaryotic or viral. Examples of suitable prokaryotic sequences include the 
Pr and P L promoters of bacteriophage lambda (The bacteriophage Lambda, Hershey, A. 
D., Ed., Cold Spring Harbor Press, Cold Spring Harbor, NY (1973), which is incorporated 
herein by reference in its entirety; Lambda H, Hendrix, R. W., Ed, Cold Spring Harbor 
Press, Cold Spring Harbor, NY (1980) , which is incorporated herein by reference in its 
entirety); the trp, recA, heat shock, and lacZ promoters of E. coli and the SV40 early 
promoter (Benoist, et al Nature, 1981, 290, 304-310, which is incorporated herein by 
reference in its entirety). Additional promoters include, but are not limited to, mouse 
mammary tumor virus, long terminal repeat of human immunodeficiency virus, maloney 
virus, cytomegalovirus immediate early promoter, Epstein Barr virus, rous sarcoma virus, 
human actin, human myosin, human hemoglobin, human muscle creatine, and human 
metalothionein. 



Additional regulatory sequences can also be included in preferred vectors. 
Preferred examples of suitable regulatory sequences are represented by the Shine- 
Dalgarno of the replicase gene of the phage MS-2 and of the gene ell of bacteriophage 
lambda. The Shine-Dalgarno sequence may be directly followed by the DNA encoding 
THP and result in the expression of the mature THP protein. 

Moreover, suitable expression vectors can include an appropriate marker 

selected host is carried out using any one of the various techniques well known to the 
expert in the art and described in Sambrook et al., supra. 

An origin of replication can also be provided either by construction of the 
vector to include an exogenous origin or may be provided by the host ceil chromosomal 
replication mechanism. If the vector is integrated into the host cell chromosome, the latter 
may be sufficient Alternatively, rather than using vectors which contains viral origins of 
replication, one skilled in the art can transform mammalian cells by the method of co- 
transformation with a selectable marker and TUP DNA. An example of a suitable marker 
is dihydrofolate reductase (DHFR) or thymidine, kinase (see, U.S. Patent No. 4,399,216). 

Nucleotide sequences encoding THP may be recombined with vector DNA 
in accordance with conventional techniques, including blunt-ended or staggered-ended 
termini for ligation, restriction enzyme digestion to provide appropriate termini, filling in 
of cohesive ends as appropriate, alkaline phosphatase treatment to avoid UA<desiderable 
joining, and ligation with appropriate ligases. Techniques for such manipulation are 
disclosed by Sambrook et al., supra and are well kriown in the artrMethpds for 
construction of mammalian expression vectors a^ 

aL, MolXZell Biol, 198373, 280,Cosman etal, Mol Immunol. , 1 986, 23, 935, Cosmari et 
al, Nature, 1984, 312, 768,.EP-A-0367566, and WO 9 1/1 8982, each of which is 
incorporated herein? by ^reference, in its entirety: 

Another aspect of the present invention is directed to transformed host cells 
having an expression vector comprising any of the nucleic acid molecules described 
above. Expression of the nucleotide sequence occurs when the expression vector is 
introduced into an appropriate host cell. Suitable host cells for expression of the 
polypeptides of the invention include, but are not limited to, prokaryotes, yeast, and 
eukaryotes. If a prokaryotic expression vector is employed, then the appropriate host cell 
would be any prokaryotic cell capable of expressing the cloned sequences. Suitable 
prokaryotic cells include, but are not limited to, bacteria of the genera Escherichia, 
Bacillus, Salmonella, Pseudomonas+ Streptomyces, and Staphylococcus, . 

If a eukaryotic expression vector is employed, then the appropriate host cell 
would be any eukaryotic cell capable of expressing the cloned sequence. Preferably, 
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eukaryotic cells are ceUs of higher eukaryotes. Suitable eukaryotic cells include, but are 
not limited to, non-human mammalian tissue culture cells and human tissue culture cells. 
Preferred host cells include, but are not limited to, insect cells, HeLa cells, Chinese 
hamster ovary cells (CHO cells), African green monkey kidney cells (COS cells), human 
293 cells, and murine 3T3 fibroblasts. Propagation of such cells in cell culture has become 
a routine procedure (see, Tissue Culture, Academic Press, Kruse and Patterson, eds. 
(1973), which is incorporated herein by reference in its entirety). 

In addition, a yeast host may be employed as a host cell. Preferred yeast 
cells include, but are not limited to, the genera Saccharomyces, Pichia, and Kluveromyces. 
Preferred yeast hosts are S. cerevisiae and P. pastoris. Preferred yeast vectors can contain 
an origin of replication sequence from a 2T yeast plasmid, an autonomously replication 
sequence (ARS), a promoter region, sequences for polyadenylation, sequences for 
transcription termination, and a selectable marker gene. Shuttle vectors for replication in 
both yeast and E. coli are also included herein. 

Alternatively, insect cells may be used as host cells. In a preferred 
embodiment, the polypeptides of the invention are expressed using a baculovirus 
expression system (see, Luckow et at, Bio/Technology, 1988, 6, 47, Baculovims 
Expression Vectors: A Laboratory Manual, O'Rielly et al. (Eds.), W.H. Freeman and 
Company, New York, 1992, and U.S. Patent No. 4,879,236, each of which is incorporated 
herein by reference in its entirety). In addition, the MAXBAC™ complete baculovirus 
expression system (Invitrogen) can, for example, be used for production in insect cells. 

Another aspect of the present invention is directed to compositions, 
including pharmaceutical compositions, comprising any of the nucleic acid molecules or 
recombinant expression vectors described above and an acceptable carrier or diluent 
Preferably, the carrier or diluent is phannaceutically acceptable. Suitable carriers are 
described in the most recent edition of Remington's Pharmaceutical Sciences, A Osol, a 
standard reference text in this field, which is incorporated herein by reference in its 
entirety. Preferred examples of such carriers or diluents include, but are not limited to, 
water, saline, Ringer's solution, dextrose solution, and 5% human serum albumin. 
Liposomes and nonaqueous vehicles such as fixed oils may also be used. The 
formulations are sterilized by commonly used techniques. 

Another aspect of the present invention is directed to an isolated 
polypeptide encoded by a nucleic acid molecule described above. In preferred 
embodiments of the invention, the isolated polypeptide comprises the amino acid sequence 
set forth in SEQ ID NO:5. Alternatively, the polypeptide is a fragment of the polypeptide 
encoding SEQ ID NO:5. Preferably, the fragment comprises from about 5 to about 20 
amino acids, from about 21 to about 40 amino acids, from about 41 to about 60 amino 
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acids, from about 61 to about 80 amino acids, from about 81 to about 100 amino acids, 
from about 101 to about 120 amino acids, from about 121 to about 140 amino acids, from 
about 141 to about 160 amino acids, from about 161 to about 180 amino acids, from about 
181 to about 200 amino acids, from about 201 to about 220 amino acids, from about 221 to 
5 about 240 amino acids, from about 241 to about 260 amino acids, from about 261 to about 
280 amino acids, from about 281 to about 300 ammo acids, from about 301 to about 320 
aiaki imM& rf mmmh w— tfcte a\su\*#> I msmmgm sre tmi^ t mvm^ ablur BStfamino 
acids, from about 361 to about 380 amino acids, from about 381 to about 400 amino acids, 
from about 401 to about 420 amino acids, from about 421 to about 440 amino acids, from' 
10 about 441 to about 460 amino acids, from about 461 to about 480 amino acids, from about 
481 to about 500 amino acids, from about 501 to about 520 amino acids, from about 521 to 
about 540 amino acids, from about 541 to about 560 amino acids, from about 561 to about 
580 amino acids, from about 581 to about 600 amino acids, from about 601 to about 620 
amino acids, from about 621 to about 640 amino acids, from about 641 to about 660 amino 
!'J 15 acids, from about 661 to about 680 amino acids, from about 681 to about 700 amino acids, 
I. i from about 701 to about 720 amino acids, from about 721 to about 740 amino acids, from' 

j about 741 to about 760 amino acids, from about 761 to about 780 amino acids, from about 

];3 781 to about 800 amino acid s. from about 80 1 to about 820 amino acids, from about 82 1 to 

j; j about 840 amino acids, from about 841 to about 860 amino acids, from about 861 to about 

20 880 amino acids, from about 881 to about 900 amino acids, from about 901 to about 920 
" amino acids > from about 921 to about 940 amino acids, from about 941 .to abquf 960 amino 

I ^ acids, from about 961 to about 980 amino acids, from about 98rto about 1000 amino 

^ acids, from about 1001 to about 1020 amino acids, from about 102r to ab 0 url040 amino 

I S acids » fr° m a 00 " 1 1 about 1060 amino acids, from about 1061 to. about 1080 amino 

25 acids; from about 1081 to about 1 100 amino acids, from about 1 101 to about 1 120 amino 
acids, from about 1121 to about 1140 amino acids, from about 1141 to about 1160 amino 
acids, or from about 1 161 to about 1 165 amino acids, and any combinations thereof. The 
fragment can be located within any portion of SEQ ID NO:5. 

In another preferred embodiment of the invention, the polypeptide 
30 comprises an amino acid sequence homologous to SEQ ID NO:5 or a fragment thereof as 
described above. It is to be understood that the present invention includes proteins 
homologous to, and having essentially at least one biological property that is substantially 
similar to a biological property of the polypeptide. Preferably, the extent of the activity of 
the biological property is at least 10%, more preferably at least 20%, more preferably at 
35 least 30%, more preferably at least 40%, more preferably at least 50%, more preferably at 
least 60%, more preferably at least 70%, more preferably at least 80%, more preferably at 
least 90%, and most preferably 100% of the activity of the biological property of THP 
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(e.g., SEQ ID NO:5). Such proteins are also called variants. This definition is intended to 
encompass isofonns, natural allelic variants, and splice variants of the Thp gene described 
herein. These variant forms may result from, for example, alternative splicing or 
differential expression in different tissue of the same source organism. The variant forms 
5 may be characterised by, for example, amino acid insertion(s), deletion(s) or 

substitution(s). In this connection, a variant form having an amino acid sequence which 
has at least about 70% sequence homology, at least about 80% sequence homology, 
preferably about 90% sequence homology, more preferably about 95% sequence 
homology and most preferably about 98% sequence homology to SEQ ID NO:5, is 
10 contemplated as being included in the present invention. A preferred homologous 

polypeptide comprises at least one conservative amino acid substitution compared to SEQ 
ID NO:5. Amino acid "insertions", "substitutions" or "deletions" are changes to or within 
an amino acid sequence. The variation allowed in a particular amino acid sequence may 
be experimentally determined by producing the peptide synthetically or by systematically 
15 making insertions, deletions, or substitutions of nucleotides in the Thp sequence using 
\ j recombinant DNA techniques. 

y Alterations of the naturally occurring amino acid sequence can be 

*j accomplished by any of a number of known techniques. For example, mutations can be 

j;3 introduced into the polynucleotide encoding a polypeptide at particular locations by 

j. 20 procedures well known to the skilled artisan, such as oligonucleoti de-directed 

S3 mutagenesis, which is described by Walder et ah, Gene, 1986, 42, 133, Bauer et aL, Gene, 

A 1985, 57, 73, Craik, BioTechniques, January 1985, pp.12-19, Smith et aL, Genetic 

O Engineering: Principles and Methods, Plenum Press (1981), and U.S. Patent Numbers 

"J 4,5 1 8,584 and 4,737,462, each of which is incorporated herein by reference in its entirety. 

25 Preferably, a THP variant of the present invention will exhibit substantially 

the biological activity of a naturally occurring THP polypeptide. By "exhibit substantially 
the biological activity of a naturally occurring THP polypeptide" is meant that THP 
variants within the scope of the invention can comprise conservatively substituted 
sequences, meaning that one or more amino acid residues of a THP polypeptide are 
30 replaced by different residues that do not alter the secondary and/or tertiary structure of the 
THP polypeptide. Such substitutions may include the replacement of an amino acid by a 
residue having similar physicochemical properties, such as substituting one aliphatic 
residue (He, Val, Leu or Ala) for another, or substitution between basic residues Lys and 
Arg, acidic residues Glu and Asp, amide residues Gin and Asn, hydroxyl residues Ser and 
35 Tyr, or aromatic residues Phe and Tyr. Further information regarding making 

phenotypically silent amino acid exchanges can be found in Bowie et al., Science, 1990, 
247 j 1 306- 1 3 1 0, which is incorporated herein by reference in its entirety. Other THP 
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variants which might retain substantially the biological activities of THP are those where 
amino acid substitutions have been made in areas outside functional regions of the protein. 

The polypeptides to be expressed in such host cells may also be fusion 
proteins which include regions from heterologous proteins. Such regions may be included 
5 to allow, e.g, 9 secretion, improved stability, or facilitated purification of the polypeptide. 
For example, a sequence encoding an appropriate signal peptide can be incorporated into 

in-frame to the polynucleotide sequence so that the polypeptide is translated as a fusion 
protein comprising the signal peptide. A signal peptide that is functional in the intended 
10 host cell promotes extracellular secretion of the polypeptide. Preferably , the signal 

sequence will be cleaved from the polypeptide upon secretion of the polypeptide from the 
cell. Thus, preferred fusion proteins can be produced in which the N-terminus of THP is 
fused to a carrier peptide. 

In one embodiment, the polypeptide comprises a fusion protein which 

0 15 ^ clud ^ aheterol °S°^ Many of 

) J the available peptides used for such a function allow selective binding of the fusion protein 

1 i to a binding partner. A preferred binding partner includes one or more of the IgG binding 
O domains of protein A are easily purified to homogeneity by affinity chromatography on, 

j; j for example, IgG-coupled Sepharose. Alternatively, many vectors have the advantage of 

!: 20 carrying a stretch of histidine residues that can be expressed at the N-terminal or C- 

W terminal end of the target protein. Thus the protein of interest can be recovered by metal 

i; j chelation chromatography. A nucleotide sequence encoding a recogrjition site for a 

** I proteolytic enzyme such as enterokini*se, factor X or, procoUagenase or thrombins may 

H immediately precede the sequence forlTHP tb permit cleavage of the fusion protein to 

25 obtain the mature THP protein. Additional examples of fusion partners include, but are 
not limited to, the yeast I-factor, the honeybee melatin leader in sf9 insect cells, 6-His tag, 
thioredoxin tag, hemaglutinin tag, GST tag, and OmpA signal sequence tag. As will be 
understood by one of skill in the art, the binding partner which recognizes and binds to the 
peptide may be any molecule or compound including metal ions (e.g., metal affinity 
30 columns), antibodies, or fragments thereof; and any protein or peptide which binds the 
peptide, such as the FLAG tag. 

The polypeptides of the invention can be used as antigens for raising 
antibodies against the same and used to screen for compounds that modulate the activity of 
THP. THP can also be used in compositions. Accordingly, the invention relates to THP 
35 or an antibody according to the invention for use as a medicament as well as to the use of 
the molecules in the manufacture of a medicament directed to wards cancers or tumors, 
such as, for example, basal cell carcinomas. The molecule used as medicaments according 




to the invention may be the polypeptides or antibodies described herein as well as any 
novel substance identified in a screening method described herein. 

In another aspect, the invention provides THP polypeptides with or without 
associated native pattern glycosylation, acylauon, sialylation, or other post-translational 
modifications. THP expressed in yeast or mammalian expression systems (discussed 
below) may be similar to or significantly different ~from a native THP polypeptide in 
molecular weight and glycosylation pattern. Of course, expression of THP in bacterial 
expression systems will provide non-glycosylated THP. 

Another aspect of the present invention is directed to compositions, 
including pharmaceutical compositions, comprising any of the polypeptides described 
above and an acceptable carrier or diluent Preferably, the carrier or diluent is 
pharmaceutical^ acceptable. Compositions comprising a polypeptide, as described above, 
can be used to, for example, induce antibody formation and to induce an immune response 
for use in, for example, vaccine preparations. 

Another aspect of the present invention is directed to methods of producing 
a polypeptide comprising SEQ ID NO: 5, or a hpmolog or fragment thereof^ comprising 
introducing any of the recombinant expression vectors described above into compatible 
host cells, growing the host cells under conditions for expression of the polypeptide, and 
recovering the polypeptide from the host cells. Eukaryotic systems are preferred since 
they provide a variety of processing mechanisms which result in, for example, 
glycosylation, carboxy-terrninal amidation, oxidation or derivatization of certain amino 
acid residues, conformational control, and so forth. 

The polypeptides of the present invention are preferably provided in an 
isolated form, are preferably substantially purified, and most preferably are purified to 
homogeneity. Host cells are preferably lysed and the polypeptide is recovered from the 
lysate of the host cells. Alternatively, the polypeptide is recovered by purifying the cell 
culture medium from the host cells, preferably without lysing the host cell. The 
polypeptides can be recovered and purified from recombinant cell cultures by well-known 
methods, including ammonium sulfate or ethanol precipitation, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. 

In addition to producing these proteins by recombinant techniques, 
automated amino acid synthesizers may also be employed to produce THP polypeptides, 
or fragments a homologous protein thereof. 

Another aspect of the present invention is directed to an antibody or 
antibodies which bind to an epitope on any of the polypeptides described herein. 
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Preferably, the antibody binds to an epitope within SEQ im«D:5. The antibodies 
according to the invention can be monoclonal or polyclonal and include individual, allelic, 
strain or species variants, or fragments thereof, both in their naturally occurring (full- 
length) forms and recombinant forms. Additionally, the antibodies are raised to the present 
proteins in either their native configuration or in non-native configurations. Anti-idiotypic 
antibodies can also be generated. Hybridomas which produce antibodies that bind to 
the po^gp tidggi » &% 

and purification of the polypeptides. In addition, antibodies may be specific inhibitors of 
THP activity. Antibodies which specifically bind to the polypeptides of the invention can 
be used to purify the protein from natural sources using well known techniques and readily 
available starting materials. Such antibodies can also be used to purify the protein from 
material present when producing the protein by recombinant DNA methodology. 

Many methods of making antibodies are known to persons skilled in the art 
For techniques for preparing monoclonal antibodies, see e.g. Stiites et al (eds.), Basic and 
Clinical Immunology (4 th ed), Lange Medical Publications, Los Altos, CA, which is 
incorporated herein by reference in its entirety, and references cited therein. Techniques 
that involve selection of libraries of recombinant antibodies in phage or similar vectors are 
described in Huse et al p Science, 1989, 246 y 1275-1281, which is incorporated herein by 
reference in its entirety . The production of antibodies and the protein structures-of 
complete, intact antibodies, Fab fragments and F(ab) 2 fragments and the organization of 
the genetic sequences that encode such molecules are well known and are also described, 
for example^ Harlow, E. and D.^Lane ( 198S) ANTIBO&IESrA Laboratory-Manual, 
Cold Spring Harbor Laboratory, Cold Spring Hari>or, NY. which is incorporated herein by 
reference. Briefly, forexample, a polypeptide, of the invention-is inj ected into mice. The 
spleen of the mouse is removed, the spleen:cells are isolated and fused with immortalized 
mouse cells. The hybrid cells, or hybridomas, are cultured and those cells which secrete 
antibodies are selected. The antibodies are analyzed and, if found to specifically bind to 
the polypeptide , the hybridoma which produces them is cultured to produce a continuous 
supply of antibodies. 

The present invention is also directed to kits, including pharmaceutical kits. 
The kits can comprise any of the THP nucleic acid molecules described above, any of the 
THP polypeptides described above, or any anti-THP antibody which binds to a 
polypeptide of the invention as described above, as well as a negative control The kit 
preferably comprises additional components, such as, for example, instructions, solid 
support, reagents helpful for quantification* and .the like. 

Another aspect of the present invention is directed to methods of inducing 
an immune re^cmse in a mammal against a polypeptide of the invention by administering * 
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to the mammal an amount of the polypeptide sufficient to induce an immune response. 
The amount will be dependent on the animal species, size of the animal, and the like but 
can be determined by those skilled in the art. 

Another aspect of the present invention is directed to methods of 
5 identifying compounds which bind to either THP or nucleic acid molecules encoding TOP, 
comprising contacting THP, or a nucleic acid molecule encoding the same, with a 
compound, and determining whether the compound binds THP, or a nucleic acid molecule 
encoding the same. Binding can be determined by binding assays which are well known 
to the skilled artisan, including, but not limited to, gel-shift assays, Western blots, 
10 radiolabeled competition assay, phage-based expression cloning, co-fractionation by 
chromatography, co-precipitation, cross linking, interaction trap/two-hybrid analysis, 
southwestern analysis, ELISA, and the like, which are described in, for example, Current 
Protocols in Molecular Biology, 1999, John Wiley & Sons, NY, which is incorporated 
herein by reference in its entirety. The compounds to be screened include (which may 
15 include compounds which are suspected to bind THP, or a nucleic acid molecule encoding 
the same), but are not limited to, extracellular, intracellular, biologic or chemical origin. 
The THP polypeptide or polynucleotide employed in such a test may either be free in 
solution, attached to a solid support, borne on a cell surface or located intracellular^. One 
skilled in the art can, for example, measure the formation of complexes between THP and 
3 the compound being tested Alternatively, one skilled in the art can examine the 

diminution in complex formation between THP and its substrate caused by the compound 
being tested. 

Another aspect of the present invention is directed to methods of 
identifying compounds which modulate (i.e., increase or decrease) activity of THP 
comprising contacting THP with a compound, and determining whether the compound 
modifies activity of THP. The activity in the presence of the test compared is measured 
and compared to the activity in the absence of the test compound. Where the activity of 
the sample containing the test compound is higher than the activity in the sample lacking 
the test compound, the compound will have increased activity. Similarly, where the 
activity of the sample containing the test compound is lower than the activity in the sample 
lacking the test compound, the compound will have inhibited activity. 

The present invention is particularly useful for screening compounds by 
using THP in any of a variety of drug screening techniques. The compounds to be 
screened include (which may include compounds which are suspected to modulate THP 
activity), but are not limited to, extracellular, intracellular, biologic or chemical origin. 
The THP polypeptide employed in such a test may be in any form, preferably, free in 
solution, attached to a solid support, borne on a cell surface or located intracellular^. One 



-23- 

skilled in the art can, for example, measure the formation of complexes between THP and 
the compound being tested Alternatively, one skilled in the art can examine the 
diminution in complex formation between THP and its substrate caused by the compound 
being tested. 

5 The activity of THP polypeptide of the invention can be determined by, for 

example, measuring the ADP-ribosylating ability of the polypeptide in the presence and 

is incorporated herein by reference in its entirety). In order to test for ADP-ribosylation 
activity, the polypeptide in question can be incubated with a radio-labeled substrate ( 32 P- 
10 NAD"*) and a suitable protein target (such as TRF1). Co-migration of radio-labeled 

product with the protein target upon gel electrophoresis would indicate a positive result for 
ADP-ribosylation of the protein target and hence PARP activity in the polypeptide in 
question. Observation of larger and variably sized ADP-ribosylated products, when the 
assay is run in the presence of excess 32 P-NAD + , would also be expected for PARP 
to 15 catalytic activity. Negative controls are warranted to preclude the possibility of extraneous 

I j PARP activity from sources other than the polypeptide in question (i.e., perform the assay 

j; j with inactivated polypeptide or in the absence of polypeptide). Confirmation of the 

*. 3 presence of ADP-ribose polymers on the protein target can be achieved by combination of 

j;- the ADP-ribosylated product with a monoclonal antibody to poly(ADP-ribose) and 

20 subsequent detection of the antigen/antibody complex. In addition, PARP-like activity can 
be verified by addition of a specific PARP inhibitor (i.e., 3-aminobenzamide) to assay runs 
J and would result in significant inhibition of ADP-ribosylation activity/ 

* 3 In another embodiment of the invention, the activity of THP polypeptide of 

the invention can be determined by, for example, measuring ANK binding. The ANK 
25 domain in tankyrase binds to TRF1 and has significant structural homology with ankyrins, 
proteins which are generally involved in proteimprotein interactions; Both the ANK 
domain of tankyrase and ankyrins contain 24 tandem arrays of the ANK motif of 33 amino 
acids and share subtle variations, including a shorter fifth ANK copy. So called "ankyrin- 
repeat proteins" have significantly fewer ANK copies, emphasizing the fact that tankyrase 
30 is a true ankyrin. Amino acid sequence analysis can be a primary means of identifying an 
ANK functional domain in any polypeptide and various binding and immunological assays 
(Le. 9 immunoprecipitation) can potentially identify an ANK domain. In addition, TRF-1 
binding assays can be performed with the polypeptides of the invention in the presence 
and absence of the test compound. 

Other assays can be used to examine enzymatic activity including, but not 
limited to, photometric, radiometric, HPLC, electrochemical, and the like, which are . 
described in, for example, Enzyme Assays: A Practical Approach, eds. R. Eisenthal and M. < 
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J. Danson, 1992, Oxford University Press, which is incorporated herein by reference in its 
entirety. Representative assays which can be used for determining an activity of THP are 
described in, for example, Smith et aL, Science, 1998, 282, 1484-1487, Simonin et aL, J. 
Biol Chem., 1993, 268, 8529-8535, and U.S. Patent No. 5,874,444, each of which is 
5 incorporated herein by reference in its entirety. 

In preferred embodiments of the invention, methods of screening for 
compounds which modulate THP activity comprise contacting the compound with THP 
and assaying for the presence of a complex between the compound and THP. In such 
assays, THP is typically labeled. After suitable incubation, free THP is separated from 
10 that present in bound form, and the amount of free or uncomplexed label is a measure of 
the ability of the particular compound to bind to THP. 

In another embodiment of the invention, high throughput screening for 
compounds having suitable binding affinity to THP is employed. Briefly, large numbers 
of different small peptide test compounds are synthesised on a solid substrate. The peptide 
15 test compounds are contacted with THP and washed. Bound THP is then detected by 
methods well known in the art. 

Purified polypeptides of the invention can also be coated directly onto 
plates for use in the aforementioned drug screening techniques. In addition, non- 
neutralizing antibodies can be used to capture the protein and immobilize it on the solid 
support. 

Other embodiments of the invention comprise using competitive screening 
assays in which neutralizing antibodies capable of binding a polypeptide of the invention 
specifically compete with a test compound for binding to the polypeptide. In this manner, 
the antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with THP. Radiolabeled competitive binding studies are described 
in AJL Lin et aL Antimicrobial Agents and Chemotherapy, 1997, vol. 41, no. 10. pp. 
2127-2131, the disclosure of which is incorporated herein by reference in its entirety. 

In other embodiments of the invention, the polypeptides of the invention 
are employed as a research tool for identification, characterization and purification of 
interacting, regulatory proteins. Appropriate labels are incorporated into the polypeptides 
of the invention by various methods known in the art and the polypetides are used to 
capture interacting molecules. For example, molecules are incubated with the labeled 
polypeptides, washed to removed unbound polypeptides, and the polypeptide complex is 
quantified. Data obtained using different concentrations of polypeptide are used to 
calculate values for the number, affinity, and association of polypeptide with the protein 
complex. 




Labeled polypeptides are also useful as reagents for the purification of 
molecules with which the polypeptide interacts including, but not limited to, inhibitors. In 
one embodiment of affinity purification, a polypeptide is covalently coupled to a 
chromatography column. Cells and their membranes are extracted, and various cellular 
subcomponents are passed over the column. Molecules bind to the column by virtue of 
their affinity to the polypeptide. The polypeptide-complex is recovered from the column, 
dismaafSt^^ amino 
acid sequence is then used to identify the captured molecule or to design degenerate 
oligonucleotides for cloning the corresponding gene from an appropriate cDNA library. 

Alternatively, compounds may be identified which exhibit similar 
properties to THP of the invention, but which are smaller and exhibits a longer half time 
than THP in a human or animal body. When an organic compound is designed, a 
molecule according to the invention is used as a "lead" compound. The design of 
mimetics to known phannaceutically active compounds is a well known approach in the 
development of pharmaceuticals based on such "lead" compounds. Mimetic design, 
synthesis and testing are generally used to avoid randomly screening a large number of 
molecules for a target property. Furthermore, structural data deriving from the analysis of 
the deduced amino acid sequences encoded by the DNAs of the present invention are 
useful to design new drugs; more specific and therefore'with a higher pharmacological 
potency. 

Comparison of the protein sequence of the present invention with the 
sequences presentin all the available data bases shpwed a significant homology with the 
enzymatic >PARP domain: Accordingly; computer modelling can-be used to;develop a 
putative terdaiy structure of the proteins, of the invention based on the available 
information^of other PARP domain proteins:' Thus, novel enzyme inhibitors based on the 
predicted structure of THP can be designed. 

In a particular embodiment, the novel molecules identified by the screening 
methods according to the invention are low molecular weight organic molecules, in which 
case a composition or pharmaceutical composition can be prepared thereof for oral intake, 
such as in tablets. The compositions, or pharmaceutical compositions, comprising the 
nucleic acid molecules, vectors, polypeptides, antibodies and compounds identified by the 
screening methods described herein, can be prepared for any route of administration 
including, but not limited to, oral, intravenous, cutaneous, subcutaneous, nasal, 
intramuscular or intraperitoneal. The nature of the carrier or other ingredients will depend 
on the specific, route of administration and particular embodjmentof the invention to be ^ 
administered. Examples of techniques and protocols that are useful in this context arq, 
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inter alia, found in Remington's Pharmaceutical Sciences, 16 th edition, Osol, A (ed.), 
1980, which is incorporated herein by reference in its entirety. 

The dosage of these low molecular weight compounds will depend on the 
disease state or condition to be treated and other clinical factors such as weight and 
5 condition of the human or animal and the route of adininistration of the compound. For 
treating human or animals, between approximately 0.5 mg/kg of body weight to 500 
mg/kg of body weight of the compound can be administered. Therapy is typically 
administered at lower dosages and is continued until the desired therapeutic outcome is 
observed. 

10 The present compounds and methods, including nucleic acid molecules, 

polypeptides, antibodies, compounds identified by the screening methods described herein, 
have a variety of pharmaceutical applications and may be used, for example, to treat or 
prevent unregulated cellular growth, such as cancer cell and tumour growth. In a 
particular embodiment, the present molecules are used in gene therapy. For a review of 

£3 is gene therapy procedures, see e.g. Anderson, Science, 1992, 256 y 808-813, which is 

\ j incorporated herein by reference in its entirety. § 

1 1 The invention is further illustrated by way of the following examples which 

! I . are intended to elucidate the invention. These examples are not intended, nor are they to 

Q be construed, as limiting the scope of the invention It will be clear that the invention may 

fJ 20 be practiced otherwise than as particularly described herein. Numerous modifications and 
C 3 variations of the present invention are possible in view of the teachings herein and, 

^ therefore, are within the scope of the invention. 

Examples 1-3 presented below are actual whereas Examples 4-8 are 

*'3 prophetic. 

° 25 

EXAMPLES 

Example 1: Identification of Homolog of Human Tankyrase 

The nucleotide sequence of human tankyrase (GeneBank accession number 

AF082556), or the amino acid sequence deduced from it, was used as a query sequence 
30 against the LifeSeq™ Assembled database (Incyte Pharmaceuticals). This database, 

which contains previously identified and annotated sequences, was searched for regions of 

similarity using Gapped BLAST, (Altschul et al. 9 Nuc. Acids Res., 1997, 25, 3389, which 

is incorporated herein by reference in its entirety). 

Briefly, the BLAST algorithm, which stands for Basic Local Alignment 
35 Search Tool is suitable for detenruning sequence similarity (Altschul et al , J. MoL Biol, 

1990, 215, 403-410, which is incorporated herein by reference in its entirety). Software 

for performing BLAST analyses is publicly available through the National Center for 
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Biotechnology Information (http://wwwjicbi.nlm.nih.gov/). This algorithm involves first 
identifying high scoring sequence pair (HSPs) by identifying short words of length W in 
the query sequence that either match or satisfy some positive-valued threshold score T 
when aligned with a word of the same length in a database sequence. T is referred to as 
5 the neighbourhood word score threshold (Altschul et al, supra). These initial 

neighbourhood word hits act as seeds for initiatingsearches to find HSPs containing them. 
Tlfe ^mtS WSlftte extendeo*^^oui ^re^oSl^OTig eac£eque^e for as far as the 
cumulative alignment score can be increased. Extension for the word hits in each direction 
are halted when: 1) the cumulative alignment score falls off by the quantity X from its 
10 maximum achieved value; 2) the cumulative score goes to zero or below, due to the 

accumulation of one or more negative-scoring residue alignments; or 3) the end of either 
sequence is reached. The Blast algorithm parameters W, T and X determine the sensitivity 
and speed of the alignment. The Blast program uses as defaults a word length (W) of 1 1, 
the BLOSUM62 scoring matrix (see Henikoffer al, Proc. Natl. Acad. ScL USA, 1992, 89, 
15 10915-10919, which is incorporated herein by reference in its entirety) alignments (B) of 
50, expectation (E) of 10, M=5, N=4, and a comparison of both strands. 

The BLAST algorithm (Karlin et al,Proc. Natl Acad. Sci. USA, 1993, 90, 
5873-5787, which is incorporated herein by reference in its entirety) and Gapped BLAST 
perform a statistical analysis of the similarity between two sequences. One measure of 
20 similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which 
provides an indication of the probability by which a match between two nucleotide or 
amino acid sequences would occur by chance: For example, a nucleic acid is considered 
similar to a Thp gene or cDl^A if the smallest sum probability in comparison of the test 
nucleic acid to a Thp nucleic acid is less than about 1, preferably less tfran about 0.1, more 
25 preferably less than about 0.01, and most preferably less than about 0.00 L 

Search of the LifeSeq™ Assembled database with the tankyrascKnucleotide 
sequence as a query sequence identified a template sequence, with ID # 232670.2, having 
a statistically significant overlapping homology to the query sequence. The sequence 
identified by template ID # 232670.2 had not previously been found to be homologous to 
30 human tankyrase. 

The overlapping similarity extended from tankyrase AF082556 residues 
2565 to 4165, that compared to template 232670 residues 1 to 1589, aligning with an 
overall DNA sequence similarity of 70%. The deduced amino acid sequence of AF082556, 
residues 2565 to 4165, and template 232670, residues 1 to 1589, aligned with a similarity 
35 of 80%. The similarity overlap included residues of tankyrase encoding the two most G- 
terminal ANK repeats, the SAM domain, and the PARP domain- In addition, template 




232670.2 contained 2310 bp of sequence 3' to the overlap, not present in the published 
tankyrase sequence (AF082556). 

In the Incyte assembled database, the very 5 f end sequence of template 
232670.2 is given by Incyte clone 3206491 from Incyte library UTRSTMR01. Clone 
3206491 was found to be 3697 bp long, and was sequenced fully by automated 
sequencing. The obtained sequence confirmed the sequence of template 232670.2 from nt. 
1 to 3697. 

Example 2: Cloning of Thp cDNA 

To isolate a cDNA clone encoding full length THP, a DNA fragment 
corresponding to clone 3206491 nt 1 to 660, was used as a probe for hybridization 
screening of a phage cDNA library. The DNA fragment was amplified by the polymerase 
chain reaction (PCR) method. The PCR reaction mixture of 50 ml contained polymerase 
mixture (0.2 mM dNTPs, lx PCR Buffer and 0.75 ml Expand High Fidelity Polymerase 
(Roche Biochemicals)), 1 pg of 3206491 plasmid, and 50 pmoles of forward primer 
(GAGTATTTGTTACAACACGG) (SEQ ID NO:l) and 50 pmoles of reverse primer 
(AATCTCCTTCAGCTCCTT) (SEQ ID NO:2). Amplification was performed in an 
Applied Biosystems PE2400 thermocycler, using the following program: 95 C for 15 
seconds, 52 C for 30 seconds and 72 C for 90 seconds; repeated for 25 cycles. The 
amplified product was separated from the 3206491 plasmid by agarose gel electrophoresis, 
and purified by Qiaquick gel extraction kit (Qiagen). 

A lambda phage library containing cDNAs isolated from human brain 
(Stratagene cat. # 936213) cloned into lambda ZAPH phage-vector, was plated with E. coli 
XL-1 blue host, on 15 cm LB-agar plates at a density of 50,000 pfu per plate, and grown 
overnight at 37 C; (plated as described by Sambrook et al., supra). Phage plaques were 
transferred to nylon membranes (Amersham Hybond NJ), denatured for 2 minutes in 
denaturation solution (0.5 M NaOH, 1.5 M NaCl), renatured for 5 minutes in renaturation 
solution (1 M Tris pH 7.5, 1.5 M NaCl), and washed briefly in 2xSSC (20x SSC: 3 M 
NaCl, 0.3 M Na-citrate). Filter membranes were dried and incubated at 80 C for 120 
minutes to cross link the phage DNA to the membranes. 

The membranes were hybridized with a DNA probe prepared as described 
above. A DNA fragment (25 ng) was labeled with a- 32 P-dCTP (NEN) using Rediprime 
random priming (Amersham Pharmacia Biotech), according to manufacturers instructions. 
Labeled DNA was separated from unincorporated nucleotides by S200 spin columns 
(Amersham Pharmacia Biotech), denatured at 95 C for 5 minutes and kept on ice. The 
DNA-containing membranes (above) were pre-hybridised in 50 ml ExpressHyb (Clontech) 
solution at 68 C for 90 minutes. Subsequently, the labeled DNA probe was added to the 
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hybridizatio^olution, and the probe was left to hybridiseKthe membranes at 68 C for 
70 minutes. The membranes were was five times in 2x SSC, 0.1% SDS at 42 C for 5 
minutes each, and finally washed 30 minutes in O.lx SSC, 0.2% SDS. Filters were exposed 
to Kodak XAR film (Eastman Kodak Company, Rochester, N.Y., USA) with an 
5 intensifying screen at -80 C for 1 6 hours. One positive colony was isolated from the 
plates, and replated with about 1000 pfii on a 15 cm LB plate. Plating, plaque lift to niters 

aadi» ^rif1i7itaft^ were 

isolated form this secondary screening. 

cDNA containing plasmids (pBluescript SK-) were rescuedfrom the 
10 isolated phages by w wvo excision by culturingXL-1 bluecells co-infected with the 
isolated phages and with the Excision helper phage, as described by manufacturer 
(Stratagene). XL-blue cells containing the plasmids were plated on LB plates and grown 
at 37 C for 16 hours. Colonies (18) from each plate were replated on LB plates and 
grown. One colony from each plate was stricken onto a nylon filter in an ordered array, 
15 and the filter was placed on a LB plate to raise the colonies. The filter was then hybridized 
with a labeled probe as described above. Three positive colonies were selected and grown 
up in LB medium. Plasmid DNA was isolated from the three clones by Qiagen Midi Kit 
(Qiagen) according to the manufacturer's instructions. The size of the insert was 
determined by digesting the plasmid with the restriction enzymes NotI and Sail, which 
established-an insert size of approximately 4500 bp. The sequence of the entire insert was 
detennined by automated sequencing on both strands of the plasmids, and is shown in 
SEQ ID NO:3. The sequence contains anropen reading frame from positiohs-23 to 3520 
(SEQ ID NO:4), encoding a polypeptide (1166 amino add residues long), as shown in • 
SEQ ID NO:5 Plasmid-cbntaining SEQ ID" NO:3 <pCF497) has been deposited with 
American Type C^ture edllection (ATCQ^May 10, 1999 and has been given Deposit 
No. PTA-35. 
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Example 3: Northern Blot Analysis 

Northern blots were performed to examine the expression ofmRNA. The 
30 sense orientation oligonucleotide 5'-CCCGAGAGCTGTTCGAGGC (SEQ ID NO:6) with 
the antisense-orientation oligonucleotide 5 ' -C AATCTTTACTCTGTTATATCCT -3' 
(SEQ ID NO:7) were used as primers to amplify a portion of the THP cDNA sequence of 
SEQ ID NO:3. A fragment 606 basepairs from positions 96 to 701 long was amplified and 
used as a probe. 

35 Multiple human tissue northern blot from Clontech-CHuman D # 7767-1) 

were hybridized with the probe. Pre-hybridization was carried out at 42 C for 4 hours in 
5xSSC, lx Denhardt's reagent, 0.1% SDS, 50% fonnamide, 250 mg/ml salmon sperm 




DNA. Hybridization was performed overnight at 42 C in the same mixture with the 
addition of about 1.5xl0 6 cpm/ml of labeled probe. 

The probe was labeled with a- 32 P-dCTP by Rediprime DNA labelling 
system (Amersham Pharmacia), purified on Nick Column (Amersham Pharmacia) and 
5 added to the hybridization solution. The filters were washed several times at 42 C in 0.2x 
SSC, 0. 1% SDS. Filters were exposed to Kodak XAR film (Eastman Kodak Company, 
Rochester, N. Y., USA) with intensifying screen at -SO C. 

A single 7 kb mRNA is expressed in all tissues analysed, including heart, 
brain placenta, lung, liver, skeletal muscle, kidney and pancreas. The expression is rather 
10 constant in all tissues, except for placenta and skeletal muscle, which show an 

approximately 2-fold increase in expression levels over the other tissues. Equal loading of 
all the lanes was verified by filter hybridisation with a human GAPDH probe (data not 
shown). 
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j ' Example 4: Expression of THP in Mammalian Cells 

U 1. Expression of THP in 293 cells 

£3 For expression of THP in mammalian cells 293 (transformed human, 

£3 primary embryonic kidney cells), a plasmid bearing the relevant THP coding sequence is 

* - 20 prepared, using vector pcDNA3. 1 myc-his (Invitrogen). The plasmid contains nucleotides 

|i - 17 through 35 1 3 of SEQ ID NO:3. Vector pcDNA3 . 1 contains the c-myc epitope for 

, " 4 detection of the recombinant protein with the anti-myc antibody, a C-tenninal 

^ polyhistidine for purification with nickel chelate chromatography, and a Neomycin 

i 3 resistant gene for selection of stable transfectants. The forward primer for amplification of 

% 3 25 this THP cDNA is: 5 f - 

AAGCGGCCGCATTATGGAAAGGATCATGTCGGGTCGCCGCT-3' (SEQ ID NO:8) 
which contains a 5' extension of 19 nucleotides to introduce the NotI cloning site and 22 
nucleotides matching the THP sequence (nucleotides 1 7 through 38 of SEQ ID NO:3). 
The reverse primer is: 5^AAGGATCCACCATACCTTCAGGCCT-3 , (SEQ ID NO:9) 
30 which contains a 5' extension of 8 nucleotides to introduce an BamHl restriction site for 
cloning and 17 nucleotides corresponding to the reverse complement of the THP sequence 
from bases 3497 to 3513 of SEQ ID NO:3. The PCR conditions are 55 C as the annealing 
temperature. The PCR product is gel purified and cloned into the Notl-BamHl sites of the 
vector. 

35 The DNA is purified using Qiagen chromatography columns and 

transfected into 293 cells using SUPERFACT transfection media (Qiagen). Transiently 
transfected cells are tested for expression after 24 hours of transfection, using western 




blots probed with antiHis and anti-THP peptide antibodies. Permanently transfected cells 
are selected with G41 8 and propagated. Production of the recombinant protein is detected 
from cells by Western blots probed with anti-His, anti-Myc or anti-THP peptide 
antibodies. 

2. Expression of THP in COS cells 

For" expression ofthe THP in COS7 cells, a polynucleotide molecule having 
the sequence given as nucleotides 17 through 3513 of SEQ ID NO:3 was cloned into 
vector P SecTag2A. Vector P SecTag2A contains the murine IgK chain leader sequence for 
secretion, the c-myc epitope for detection of the recombinant protein with the anti-myc 
antibody, a C-terminal polyhistidine for purification with nickel chelate chromatography, 
and a Zeocin resistant gene for selection of stable transfectants. 

The forward primer is 5'- 
AAAAGCTTTATGGAAAGGATCATGTCGGGT 

CGCCGCTGC-3' (SEQ ID NO.10). The first 15 nucleotides of this primer constitute a 5' 
extension which introduces an Hindin restrictiqn site for cloning, followed by 22 
nucleotides which correspond to nucleotide residuesl7-38 ofthe sequence given in SEQ 
ID NO: 3. The reverse primer is SEQ ID NO:9. This primer contains a 5'- extension of 8 
nucleotides which introduces a BamHl cloning site followed by . 17 nucleotides which 
correspond to the reverse complement of bases 3497 to 3513 of the sequence given in SEQ 
IDNO:3. 

The* PGR consists* of an initial denaturation step of 5 min at 95 0, 30 cycles 
of 30 sec denaturationTat 95 C/30 sec annealing at 58*C and-30 sec extension at 72 C, 
followed by 5 min extension at 72 C. The PGR product is gel puriflwl and ligated into me 
Xbal and Sail sites of vector p3-CI. This construct is transformed into£'. co/z cells for 
amplification and DNA purification. TheDNA is purified with Qiagen chromatography 
columns and transfected into COS 7 cells using Lipofectamine reagent from BRL, 
following the manufacturer's protocols. Forty eight and 72 hours after transfection, the 
media and the cells are tested for recombinant protein expression. 

THP expressed from a COS cell culture can be purified by concentrating 
the cell-growth media to about 1 0 mg of protein/ml, and purifying the protein by, for 
example, chromatography. Purified THP is concentrated to 0.5 mg/ml in an Amicon 
concentrator fitted with a YM-10 membrane and stored at -80 C. 

Example 5: Expression of THP in Insect Cells 

For expression of THP in a baculovirus system, a polynucleotide molecule 
having the sequence given as nucleotides 17 through 3513 of SEQ ID NO:3 was amplified 
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by PCR. The forward primer is SEQ ID NO:8. The first 19 nucleotides of this primer 
constitute a 5* extension which adds the NotI cloning site, followed by followed by 22 
nucleotides which correspond to nucleotide residues 17 through 38 of the sequence given 
in SEQ ID NO:3. The reverse primer is SEQ ID NO:9. The first 8 nucleotides of this 
5 primer constitute a 5' extension which introduces the BamHl cloning site, followed by 
followed by 17 nucleotides which correspond to the reverse complement of nucleotide 
residues 3497 through 3513 of the sequence given in SEQ ID NO:3. 

The PCR product is gel purified, digested with Ndel and Kpnl, and cloned 
into the corresponding sites of vector pACHTL- A (Pharmingen, San Diego, CA). The 
10 pAcHTL expression vector contains the strong polyhedrin promoter of the Autographa 
californica nuclear polyhedrosis virus (AcMNPV), and a 6XHis tag upstream from the 
multiple cloning site. A protein kinase site for phosphorylation and a thrombin site for 
excision of the recombinant protein precede the multiple cloning site is also present. Of 
course, many other baculovirus vectors could be used in place of pAcHTL-A, such as 
£3 15 pAc373, pVL941 and pAcIMl . Other suitable vectors for the expression of THP 

j' j polypeptides can be used, provided that the vector construct includes appropriately located 

11 signals for transcription, translation, and trafficking, such as an in-frame AUG and a signal 

j"! peptide, as required. Such vectors are described in Luckow et aL, Virology 170:31-39, 

rj among others. 

* : J 20 The virus is grown and isolated using standard baculovirus expression 

n 

£2 methods, such as those described in Summers et al. (A Manual of Methods for Baculovirus 

* J Vectors and Insect Cell Culture Procedures, Texas Agricultural Experimental Station 

Bulletin No. 1555 (1987)). 
\ 3 In a preferred embodiment, p AcHLT-A containing the THP gene is 

S4 25 introduced into baculovirus using the "BaculoGold " transfection kit (Pharmingen, San 

Diego, CA) using methods established by the manufacturer. Individual virus isolates are 
analyzed for protein production by radiolabeling infected cells with 35 S-methionine at 24 
hours post infection. Infected cells are harvested at 48 hours post infection, and the 
labeled proteins are visualized by SDS-PAGE. Viruses exhibiting high expression levels 
30 can be isolated and used for scaled up expression. 

For expression of the THP polypeptide in a Sf9 cells, a polynucleotide 
molecule having the sequence given as nucleotides 17 through 3513 of SEQ ID NO:3 is 
amplified by PCR using the primers and methods described above for baculovirus 
expression. The THP cDNA is cloned into vector p AcHLT-A (Pharmingen) for 
35 expression in Sf9 insect The insert is cloned into the NotI and BamHl sites, after 
elimination of an internal Ndel site (using the same primers described above for 
expression in baculovirus). DNA is purified with Qiagen chromatography columns and 




expressed in Sf9 cells. Preliminary Western blot experiments from non purified plaques 
are tested for the presence of the recombinant protein of the expected size which reacted 
with the THP-specific antibody. These results are confirmed after further purification and 
expression optimization in HiG5 cells. 

Example 6: Interaction Trap/Two-Hybrid System 

In order to assay for THP-interacting proteins, the interaction trap/two- 
hybrid library screening method can be used. This assay was first described in Fields, et 
al, Nature, 1989, 540,245, which is incorporated herein by reference in its entirety. A 
protocol is published in Current Protocols in Molecular Biology 1999, John Wiley & 
Sons, NY and Ausubel, F. M. et al. 1992, Short protocols in molecular biology, fourth 
edition, Greene and Wiley-interscience, NY, which is incorporated herein by reference in 
its entirety. Kits are available from Clontech, Palo Alto, CA (Matchmaker Two-Hybrid 
System 3). 

A fusion of the nucleotide sequences encoding all or partial THP and the 
yeast transcription factor GAL4 DNA-binding domain (DNA-BD) is constructed in an 
appropriate plasmid (ie. pGBKT7) using standard subcloning techniques. Similarly, a 
GAL4 active domain (AD) fusion library is constructed in a second plasmid (ie. pGADT7) 
from cDNA of potential THP-bindihg proteins (for protocols on forming cDNA libraries, 
see Sambrook et al. \9%%Molecular cloning: a laboratory manual, second edition, Cold 
Spring Harborfress* Cold Spring Harbor, NY), which is incorporated herein by reference 
in its entirety. TKe DNA-BD7THP fusion construct is verified by sequencing, and tested 
for autonomous reporter gene, activation and cell toxicity, both pf which would prevent a 
successful two-hybrid analysis. Similar controls are^peiformed with the AO/library fusion 
construct to ensure expression in host cells and lack of transcriptional activity. Yeast cells 
are transformed (ca. 10 5 transformants/mg DNA) with both the THP and library fusion 
plasmids according to standard procedure (Ausubel, et al, 1992, Short protocols in 
molecular biology, fourth edition, Greene and Wiley-interscience, NY, which is 
incorporated herein by reference in its entirety). In vivo binding of DNA-BD/THP with 
AD/library proteins results in transcription of specific yeast plasmid reporter genes (ie. 
lacZ, HIS3, ADE2, LEU2). Yeast cells are plated on nutrient-deficient media to screen for 
expression of reporter genes. Colonies are dually assayed for P-galactosidase activity upon 
growth in Xgal (5-bromo-4-chloro-3-indolyl-P-D-galactoside) supplemented media (filter 
assay for P-galactosidase activity is described in Breeden, et al 9 Cold Spring Harb. Symp. 
Quant. Biol, 1985* 50, 643, which is incorporated herein by reference in its entirety). 
Positive AD-libraiy plasmids are rescued from transformants and reintroduced into, the 
original yeast strain as well as other strains containing unrelated DNA-BD^fusion proteins 




to confirm specific THP/library protein interactions. Insert DNA is sequenced to verify 
the presence of an open reading frame fused to GAL4 AD and to determine the identity of 
the THP-binding protein. 



5 Example 7: Mobility Shift DNA-Binding Assay Using Gel Electrophoresis 

A gel electrophoresis mobility shift assay can rapidly detect specific 
protein-DNA interactions. Protocols are widely available in such manuals as Sambrook et 
al. 1989, Molecular cloning: a laboratory manual, second edition, Cold Spring Harbor 
Press, Cold Spring Harbor, NY and Ausubel, F. M. et al. 1992, Short protocols in 
10 molecular biology, fourth edition, Greene and Wiley-interscience, NY, each of which is 
incorporated herein by reference in its entirety. 

Probe DNA(<300 bp) is obtained from synthetic oligonucleotides, 
restriction endonuclease fragments, or PCR fragments and end-labeled with 32 P. An 
aliquot of purified THP (ca. 15 ug) or crude THP extract (ca. 15 ng) is incubated at 
C 3 15 constant temperature (in the range 22-37 Q for at least 30 minutes in 10-15 pi of buffer 

f] {i.e., TAE or TOE, pH 8.0-8.5) containing radiolabeled probe DNA, nonspecific carrier 

11 DNA (ca. 1 ug), BSA (300 ug/ml), and 10% (v/v) glycerol. The reaction mixture is then 

?■? loaded onto a polyacrylamide gel and run at 30-35 mA until good separation of free probe 

i! 3 DNA from protein-DNA complexes occurs. The gel is then dried and bands 

1 4 20 corresponding to free DNA and protein-DNA complexes are detected by autoradiography. 

si 

a 

N Example 8: Assay to Identify Compounds that Modulate THP Activity 

^ THP is capable of auto-poly[ADP]-ribosylation through the activity of the 

*, 2 PARP domain. To screen for this THP activity, THP expressed in baculovirus is used (see, 

O 25 Example 5). Four ug of THP is incubated in 100 pi of assay buffer (50 mM Tris, pH 8.0, 

4 mM MgCl 2 ,0.2 mM DTT) together with 1.3 pM 32 P-labeled NAD + , at 25 C for 30 
minutes. Incorporation of 32 P-labeled NAD + is measured by TCA precipitation, by 
addition of 20% TCA, and collection by filtration of insoluble material onto glass filters. 
The filters are counted by scintillation counting in a fluid beta-counter, and the resulting 
30 counts are used as a measure for THP PARP activity. This same assay can also be used, 
for example, to screen large libraries of chemical compounds for small molecules that are 
THP PARP inhibitors. 

Some of the preferred embodiments of the invention described above are 
outlined below and include, but are not limited to, the following embodiments. As those 
35 skilled in the art will appreciate, numerous changes and modifications may be made to the 
preferred embodiments of the invention without departing from the spirit of the invention. 
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It is intended that all such variations fall within the scope of the invention. The 
entire disclosure of each publication cited herein is hereby incorporated by reference. 
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WHAt IS CLAIMED IS: 

\/ An isolated nucleic acid molecule comprising a nucleotide sequence selected from 
^me group consisting of: 

a) SEQ ID NO:3, or a fragment thereof; 
5 b) SEQ ID NO:4, or a fragment thereof; 

c) a sequence homologous to SEQ ID NO:3 or SEQ ID NO:4, or a fragment 
thereof; 

d) a sequence that encodes a polypeptide comprising SEQ ED NO:5, or a 
fragment thereof; and 

10 e) a sequence that encodes a polypeptide comprising an amino acid sequence 

homologous to SEQ ID NO:5, or a fragment thereof; 

said nucleic acid molecule encoding at least a portion of a tankyrase homolog 

protein. 

15 2. The nucleic acid molecule of claim 1 wherein said nucleic acid molecule is DNA. 

3. The nucleic acid molecule of claim 1 wherein said nucleic acid molecule is RNA. 

4. The nucleic acid molecule of claim 2 wherein said nucleotide sequence comprises 
\% 20 SEQIDNO:4. 

U / 

1 1 Sr An isolated nucleic acid molecule comprising a nucleotide sequence 

i 3 ''complementary to at least a portion of SEQ ID NO:3 or SEQ ID NO:4, a complement of 

0 said nucleic acid molecule encoding at least a portion of a tankyrase homolog protein. 

« 25 

» ' 6. The nucleic acid molecule of claim 5 wherein said molecule is an antisense 

;L oligonucleotide directed to SEQ ID NO:3 or SEQ ED NO:4. 

*. i 

; «i 7. The nucleic acid molecule of claim 6 wherein said oligonucleotide is directed to a 

1 j 30 regulatory region of SEQ ID NO:3 or SEQ ID NO:4. 

O 

i : 3 8. An expression vector comprising a nucleic acid molecule of claim 1 or 5. 

9. An expression vector of claim 8, wherein said nucleic acid molecule comprises 
35 SEQ ID NO:4. 

10. The vector of claim 8 wherein said vector is a plasmid. 

1 1 . The vector of claim 8 wherein said vector is a viral particle. 

40 

12. The vector of claim 1 1 wherein said vector is selected from the group consisting of 
adenoviruses, parvoviruses, herpesviruses, poxviruses, adeno-associated viruses, Semliki 
Forest viruses, vaccinia viruses, and retroviruses. 

45 13. The vector of claim 8 wherein said nucleic acid molecule is operably connected to 
a promoter selected from the group consisting of simian virus 40, mouse mammary tumor 
virus, long terminal repeat of human immunodeficiency virus, maloney virus, 
cytomegalovirus immediate early promoter, Epstein Barr virus, rous sarcoma virus, human 




actin, human myosin, human hemoglobin, human muscle creatine, and human 
metalothionein. 

14. A host cell transformed with a vector of claim 8. 

15. The transformed host cell of claim 14 wherein said cell is a bacterial cell. 

16. _ Thuan^f^CT^.^o^f lUf rj aiaaja, wkmwmmtikfirt&mLtit?] l m B <mli~~ 

1 7. The transformed host cell of claim 14 wherein , said cell is a yeast 

18. The transformed host cell of claim 17 wherein said yeast is S. cerevisiae. 

19. The transformed host cell of claim 14 wherein said cell is an insect cell. 

20. The transformed host cell of claim 19 wherein said insect cell is S.frugiperda. 

21 . The transformed host cell of claim 14 wherein said cell is a mammalian cell. 

22. The transformed host cell of claim 21 wherein mammalian cell is selected from the 
group consisting of Chinese hamster ovary cells, HeLa cells, African green monkey kidney 
cells, human 293 cells, and murine 3T3 fibroblasts. 

23. A method of producing a polypeptide comprising SEQ ID NO:5, or a homolog or 
fragment thereof comprising the steps of: 

a) introducing a recombinant expression vector of claim 7 into a compatible 
host, cell; 

b) growing said host cell under conditions for expression of said polypeptide; 

and 

c) recovering said polypeptide. ^ 

24. The method of claim 23 wherein said host- cell is lysed and said polypeptide is 
recovered from the lysate of said host cell. , 

25. The method of claim 23 wherein said polypeptide is recoyered^by purifying the 
culture medium without lysing said host cell. 

26. A composition comprising a nucleic acid molecule of claim 1 or 5 and an 
acceptable carrier or diluent. 

27. A composition comprising a recombinant expression vector of claim 8 and an 
acceptable carrier or diluent. 

28. An isolated polypeptide encoded by a nucleic acid molecule of claim 1 . 

29. The polypeptide of claim 28 wherein said polypeptide comprises SEQ ED NO:5. 

30. The polypeptide.of claim 28 wherein said polypeptide comprises an amino acid 
sequence homologous Jo SEQ ID NO:5. 
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3 1 . The polypeptide of claim 30 wherein said sequence homologous to SEQ ID NO: 5 
comprises at least one conservative amino acid substitution compared to SEQ ED NO:5. 

32. The polypeptide of claim 28 wherein said polypeptide comprises a fragment of. 
5 SEQIDNO:5. 

33. A composition comprising a polypeptide ofclaim 28 and an acceptable carrier or 
diluent. 

10 34. An isolated antibody which binds to an epitope on a polypeptide ofclaim 28. 

35. The antibody ofclaim 34 wherein said antibody is a monoclonal antibody. 

36. A composition comprising an antibody of claim 34 and an acceptable carrier or 
15 diluent. 

37. A kit comprising an antibody which binds to a polypeptide of claim 28 and a 
negative control antibody. 

J j 20 38. The kit ofclaim 37 further comprising an additional kit component. 

U 

I ^ 39. The kit ofclaim 38 wherein said additional kit component comprises instructions. 

h 

■J 40. A kit comprising a nucleic acid molecule ofclaim 1 or 5. 

C 5 25 41. The kit of claim 40 further comprising an additional kit component. 



3 



42. The kit ofclaim 41 wherein said additional kit component comprises instructions. 

;! z 43 . A kit comprising a polypeptide of claim 28. 

^3 44. The kit ofclaim 43 further comprising an additional kit component. 

--3 

45. The kit ofclaim 44 wherein said additional kit component comprises instructions. 

35 46. A method of inducing an immune response in a mammal against a polypeptide of 
claim 28 comprising administering to said mammal an amount of said polypeptide 
sufficient to induce said immune response. 

47. A method for identifying a compound which binds tankyrase homolog protein 
40 / (THP) comprising the steps of: 

a) contacting THP with a compound; and 

b) determining whether said compound binds THP. 

48. The method ofclaim 47 wherein binding of said compound to THP is 
45 determined by a protein binding assay. 



49. The method of claim 48 wherein said protein binding assay is selected from 

the group consisting of a gel-shift assay, Western blot, radiolabeled competition assay, 
phage-based expression cloning, co-fractionation by chromatography, co-precipitation, 
cross linking, interaction trap/two-hybrid analysis, southwestern analysis, and ELISA. 

5 

5P: A method for identifying a compound which binds a nucleic acid molecule 

Encoding tankyirase homolog protem (110*^ the steps of: 

a) contacting said nucleic acid molecule encoding THP with a 
compound; and 

!0 b) determining whether said compound binds said nucleic acid 

molecule. 

5 1 . The method of claim 50 wherein binding is determined by a gel-shift assay. 




15 Jrl. A method for identifying a compound which modulates the activity of 

tankyrase homolog protein (THP) comprising the steps of: 

a) contacting THP with a compound; and 

b) detennining whether THP activity has been modulated. 

2 0 53 . The .method of claim; 52 wherein said activity is ADP^ribosylation. 

54. The method of claim 52 wherein said activity is TRF1 binding. 

55. A comppundddentified,by the method of claim 47. 

25 

56. A compound identified by the method of claim 50. 



57. 



A compound identified by the method of claim 52. 




ABSTRACT OF THE DISCLOSURE 
The present invention provides a human polypeptide homolog of human 
tankyrase protein (THP) and polynucleotides which identify and encode THP. In addition, 
the invention provides expression vectors, host cells and methods for its production. The 
invention also provides methods for the identification of THP agonists/antagonists, useful 
for the treatment of human diseases, such as human cancer and age related diseases. 
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S Utility Patenr □ Design Patent 
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CH was filed on as Serial No. 
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i 3 below and, insofar as the subject matter of each of the claims of this application is not 
|; 3 disclosed in the prior United States application in the manner provided by the first paragraph 
f J of 35 U.S.C. § 1 12, 1 acknowledge the duty to disclose to the U.S. Patent and Trademark 
Office all information known to be material to patentability as defined in 37 CFR § 1 .56 
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f i s PCT international filing date of this application: 

O 

* 3 Serial Number Date Filed Patented/Pending/Abandoned 
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application or any patent issued thereon. 
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SEQUENCE LISTING 



<1 10> Berthelsen, Jens 
Toma, Salvatore 
Isacchi, Antonella 

<120> Tanyrase Homolog Protein (THP), Nucleic Acids, And Methods Related To The Same 
<130> PUJ-0043 

<140> 
<141> 

<160>7 

<170> Patentln Ver. 2.0 

<210> 1 
c j <211>20 
fj <212>DNA 
i "j <2 1 3> Artificial Sequence 

O <220> 

j." 3 <223> Description of Artificial Sequence: PCR Primers 

j; 3 <400> l 

>'j gagtatttgt tacaacacgg 20 

a 

O 

O <210>2 
O <211>18 
<212>DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR Primers 



<400>2 

aatctccttc agctcctt 



18 



<210>3 

<211>4512 

<212>DNA 

<2 1 3> Homo sapien 



1 



<220> 
<223> 
<400> 3 

gaattccgcg gcggccagga tcatgtcggg tcgccgctgc gccggcgggg gagcggcctg 60 
cgcgagcgcc gcggccgagg ccgtggagcc ggccgcccga gagctgttcg aggcgtgccg 120 
caacggggac gtgggafigaftjfiai^^ 

cacggcgggc aggaaatcca ccccgctgca cctcgccgca ggttttgggc ggaaagacgt 240 
agttgaatat ttgcttcaga atggtgcaaa tgtccaagca cgtgatgatg ggggccttat 300 
tcctcttcat aatgcatgct cttttggtca tgctgaagta gtcaatctcc ttttgcgaca 360 
tggtgcagac cccaatgctc gagataattg gaattatact cctctccatg aagctgcaat 420 
taaaggaaag attgatgttt gcattgtgct gttacagcat ggagctgagc caaccatccg 480 
j j aaatacagat ggaaggacag cattggattt agcagatcca tctgccaaag cagtgcttac 540 

h 

C 2 tggtgaatat aagaaagatg aactcttaga aagtgccagg agtggcaatg aagaaaaaat 600 

*.3 

j;3 gatggctcta ctcacaccat taaatgtcaa ctgccacgca agtgatggca gaaagtcaac 660 
ji » tccattacat ttggcagcag gatataacag agtaaagatt gtacagQtgt tactgcaaca 720 

S I 
== 

i: 3 tggagctgat gtccatgcta aagataaagg tgatctggta ccattacaca atgcctgttc 780 
2 ttatggtcat tatgaagtaa ctgaactttt ggtcaagcat ggtgcctgtg taaatgcaat 840 
ggacttgtgg caattcactc ctcttcatga ggc^gctfct aa^ 
ttctcttctc ttaagttatg gtgcagaccc aacactgctc aattgtcaca ataaaagtgc 960 
tatagacttg gctcccacac cacagttaaa agaaagatta gcatatgaat ttaaaggcca 1020 
ctcgttgctg caagctgcac gagaagctga tgttactcga atcaaaaaac atctctctct 1080 
ggaaatggtg aatttcaagc atcctcaaac acatgaaaca gcantgcatt gtgctgctgc 1 140 
atctccatat cccaaaagaa agcaaatatg tgaactgttg ctaagaaaag gagcaamcat 1200 
caatgaaaag actaaagaat tcttgactcc tctgcacgtg gcatctgaga aagctcataa 1260 , 
tgatrttgtt gaagtagtgg tgaaacatga agcaaaggtt aatgctctgg ataatcttgg 1320 



tcagacttct ctacacagag clgcatattg tggtcatcta caaacctgcc gcctactcct 1380 

gagctatggg tgtgatccta acattatatc ccttcagggc tttactgctt tacagatggg 1440 
aaatgaaaat gtacagcaac tcctccaaga gggtatctca ttaggtaatt cagaggcaga 1500 

cagacaattg ctggaagctg caaaggctgg agatgtcgaa actgtaaaaa aactgtgtac 1S60 

tgttcagagt gtcaactgca gagacattga agggcgtcag tctacaccac ttcattttgc 1620 

agctgggtat aacagagtgt ccgtggtgga atatctgcta cagcatggag ctgatgtgca 1680 

tgctaaagat aaagrrgscc ttgtaccttt gcacaatgca tgttcttatg gacattatga 1740 

agttgcagaa cttcttgtta aacatggagc agtagttaat gtagctgatt tatggaaatt 1800 

tacaccttta catgaagcag cagcaaaagg aaaatatgaa atttgcaaac ttctgctcca 1 860 

gcatggtgca gaccctacaa aaaaaaacag ggatggaaat actcctttgg atcttgttaa 1 920 

I ] agatggagat acagatattc aagatctgct taggggagat gcagctttgc tagatgctgc 1 980 

£ 3 caagaagagt tgtttagcca gagtgaagaa gttgtcttct cctgataatg taaattgccg 2040 

- cgatacccaa ggcagacatt caacaccttt acatttagca gctggttata ataatttaga 2 1 00 

I J 

j: - agttgcagag tatttgttac aacacggagc tgatgtgaat gcccaagaca aaggaggact 2 1 60 

l 4 

p tattccttta cataatgcag catcttacgg gcatgtagat gtagcagctc tactaataaa 2220 

q 

2 gtataatgca tgtgtcaatg ccacggacaa atgggctttc acacctttgc acgaagcagc 2280 
ccaaaaggga cgaacacagc tttgtgcttt gktgctagcc catggagctg acccgactct 2340 
taaaaatcag gaaggacaaa cacctttaga tttagtttca gcggatgatg tcagcgctct 2400 
tctgacagca gccatgcccc catctgctct gccctcttgt tacaagcctc aagtgctcaa 2460 
tggtgtgaga agcccaggag ccactgcaga tgctctctct tcaggtccat ctagcccatc 2520 
aagcctttct gcagccagca gtcttgacaa cttatctggg agtttttcag aactgtcttc 2580 
agtagttagt tcaagtggaa cagagggtgc ttccagtttg gagaaaaagg aggttccagg 2640 
agtagatttt agcataactc aattcgtaag gnatcttgga cttgagcacc taatggatat 2700 
atttgagaga gaacagatca ctttggatgt attagttgag atggggcaca aggagctgaa 2760 
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ggagattgga atcaatgctt atggacatag gcacaaacta attaaaggag tcgagagac^^O 
tatctccgga caacaaggtc ttaacccata tttaactttg aacacctctg gtagtggaac 2880 
aattcttata gatctgtctc ctgatgataa agagtttcag tctgtggagg aagagatgca 2940 
aagtacagtt cgagagcaca gagatggagg tcatgcaggt ggaatcttca acagatacaa 3000 
tattacaagmttca^fg«p»^ gaaacfatgg gaaagataca ctcaccggag 3060 
aaaagaagtt tctgaagaaa accacaacca tgccaatgaa cgaatgctat ttcatgggtc 3 120 
tccttttgtg aatgcaatta tccacaaagg ctttgatgaa aggcatgcgt acataggtgg 3180 
tatgtttgga gctggcattt attttgctga aaactcttcc aaaagcaatc aatatgtata 3240 
tggaattgga ggaggtactg ggtgtccagt tcacaaagac agatcttgtt acatttgcca 3300 

■ 3 caggcagctg ctcttttgcc gggtaacctt gggaaagtct ttcctgcagt tcagtgcaat 3360 

U 

1 1 gaaaatggca cattctcctc caggtcatca ctcagtcact ggtaggccca gtgtaaatgg 3420 

a 

2 cctagcatta gctgaatatg ttatttacag aggagaacag gcttatcctg agtatttaat 3480 

f J 

tacttaccag attatgaggc ctgaaggtat ggtcgatgga.taaatagt^ttttaag^ 3540 

a 

s j ctaattccac tgaacctaaa atc&fca^ 

U 

i, 3 ctgaaaaaaa atcatcttgc ccaqaggcct gtggc^aaag gataaaaatg^gaacgaagr 36(50 
- 3 ttaacattet gacttgataa agctttaata atgtacagtg ttttcta^t atttcctgtt 372017 
ttttcagcac tttaacagat gccaiteeag gttaaactgg gttgtctgta ctaaattata-3780 
aacagagtta acttgaacct tttatatgtt atgcattgat tctaacaaac tgtaatgccc 3840 
tcaacagaac taattttact aatacaatac tgtgttcttt aaaacacagc atttacactg 3900 
aatacaattt catttgtaaa actgtaaata agagcttttg tactagccca gtatttattt 3960 
acattgcttt gtaatataaa tctgttttag aactgcagcg gtttacaaaa ttttttcata 4020 
tgtattgttc atctatactt gcatcttaca tcgtcatgat tgagtgatct ttacatttga 4080 
ttccagaggc tatgttcagt tgttagttgg gaaagattga gttatcagat ttaatttgcc 4140 
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gatgggagcc tttatctgtc attagaaatc tttclcattt aagaacttat gaatatgclg 4200 
aagatttaat ttgtgatacc tttgtatgta tgagacacat tccaaagaac tctaactatg 4260 
ataggtcctg attactaaag aagcttcttt actggcctca atttctagct ttcatgttgg 4320 
aaaattttct gcagtccttc tgtgaaaatt agagcaaagt gctcctgttt tttagagaaa 4380 
ctaaatcttg ctgttgaaca attattgtgt tcttttcatg gaacataagt aggatgttac 4440 
atttccaggg tgggaagggt aatcclaaat catttcccaa tctattctaa ttaccttaaa 4500 
tctaaaggggaa 4512 



<210>4 
<211> 3498 
<212>DNA 
*5 <2 1 3> Homo sapiens 

\\ <220> 

i;3 <221> CDS 

O <222>(1)..(3498) 

Jj3 <223> CDS 

L <400> 4 

1 atg teg ggt cgc cgc tgc gec ggc ggg gga gcg gec tgc gcg age gee 48 

C J Met Ser Gly Arg Arg Cys Ala Gly Gly Gly Ala Ala Cys Ala Ser Ala 

O 1 5 10 15 

a 

™ gcg gee gag gee gtg gag ccg gec gee cga gag ctg ttc gag gcg tgc 96 
Ala Ala Glu Ala Val Glu Pro Ala Ala Arg Glu Leu Phe Glu Ala Cys 
20 25 30 

cgc aac ggg gac gtg gaa cga gtc aag agg ctg gtg acg cct gag aag 144 
Arg Asn Gly Asp Val Glu Arg Val Lys Arg Leu Val Thr Pro Glu Lys 
35 40 45 

gtg aac age cgc gac acg gcg ggc agg aaa tec acc ccg ctg cac etc 192 
Val Asn Ser Arg Asp Thr Ala Gly Arg Lys Ser Thr Pro Leu His Leu 
50 55 60 

gec gca ggt ttt ggg egg aaagacgtagtt gaa tat ttg ctt cag aat 240 
Ala Ala Gly Phe Gly Arg Lys Asp Val Val Glu Tyr Leu Leu Gin Asn 
65 70 75 80 
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ggt gca aat gtc caa gca cgt gat gat ggg ggc ctt att cct ctt cat 288 
Gly Ala Asn Val Gin Ala Arg Asp Asp Gly Gly Leu He Pro Leu His 
85 90 95 

aat gca tgc tct ttt ggt cat get gaa gta gtc aat etc ctt ttg cga 336 
Asn Ala Cys Ser Phe Gly His Ala Glu Val Val Asn Leu Leu Leu Arg 
100 105 110 

cat ggt gca gac ccc aat get cga gat aat tgg aat tat act cct etc 384 
His Gly Ala Asp Pro Asn Ala Arg AspAsn Tip Asn Tyr Thr Pro Leu 
115 120 125 

cat gaa get gca att aaa gga aag att gat gtt tgc att gtg ctg tta 432 
His Glu Ala Ala lie Lys Gly Lys He Asp Val Cys lie Val Leu Leu 
130 135 140 

cag cat gga get gag cca acc ate cga aat aca gat gga agg aca gca 480 
Gin His Gly Ala Glu Pro Thr lie Arg Asn Thr Asp Gly Arg Thr Ala 
J* J 145 150 155 160 

U 

\ i ttg gat tta gca gat cca tct gee aaa gca gtg ctt act ggt gaa tat 528 

l: 5 Leu Asp Leu Ala Asp Pro Ser Ala Lys Ala Val Leu Thr Gly Glu Tyr 

*.3 165 170 175 

U 
v j 

; rf aag aaa gat gaa etc tta gaa agtgcc agg agt ggc aat gaa' gaa aaa 576 

t ^ Lys Lys Asp Glu Leu Leu Glu:Ser v Ala Arg Ser Gly Asn Glu Glu Lys 
^ 180 185;, 190 

a 

U atg atg get eta etc aca:cca tta aat gtc aac tgc cac gca agt gat 624 

*. 3 Met Met Ala Leu Leu Thr Pro Leu Asn Val Asn Cys His Ala Ser Asp, : 

; -5 195 200 " 205" 

ggc aga aag tea act cca tta cat ttg gca gca gga tat aac aga gta 672 
Gly Arg Lys Ser Thr Pro Leu His Leu Ala Ala Gly Tyr Asn Arg Val 
210 215 220 

aag att gta cag ctg tta ctg caa cat gga get gat gtc cat get aaa 720 
Lys lie Val Gin Leu Leu Leu Gin His Gly Ala Asp Val His Ala Lys 
225 230 235 240 

gat aaa ggt gat ctg gta cca tta cac aat gec tgt tct tat ggt cat 768 
Asp Lys Gly Asp Leu Val Pro Leu His Asn Ala Cys Ser Tyr Gly His 
245 250 255 

tat gaa gta act gaa ctt ttg gtc aag.cat ggt gec tgt gta aat gca: 816 

Tyr Glu Val Thr Glu Leu Leu Val Lys His Gly Ala Cys Val Asn Ala . 
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260 265 270 

atg gac ttg tgg caa ttc act cct ctt cat gag gca get tct aag aac 864 
Met Asp Leu Trp Gin Phe Thr Pro Leu His Glu Ala Ala Ser Lys Asn 
275 280 285 

a gg gtt gaa gta tgt tct ctt etc tta agt tat ggt gca gac cca aca *_912 
Arg Val Glu Val Cys Ser Leu Leu Leu Ser Tyr Gly Ala Asp Pro Thr 
290 295 300 

ctg etc aat tgt cac aat aaa agt get ata gac ttg get ccc aca cca 960 
Leu Leu Asn Cys His Asn Lys Ser Ala He Asp Leu Ala Pro Thr Pro 
305 310 315 320 

cag tta aaa gaa aga tta gca tat gaa ttt aaa ggc cac teg ttg ctg 1008 
Gin Leu Lys Glu Arg Leu Ala Tyr Glu Phe Lys Gly His Ser Leu Leu 
325 330 335 

^ caa get gca cga gaa get gat gtt act cga ate aaa aaa cat etc tct 1056 
f 5 j Gin Ala Ala Arg Glu Ala Asp Val Thr Arg lie Lys Lys His Leu Ser 
U 340 345 350 

a 

O ctg gaa atg gtg aat ttc aag cat cct caa aca cat gaa aca gca ntg 1 104 
p Leu Glu Met Val Asn Phe Lys His Pro Gin Thr His Glu Thr Ala Xaa 
355 360 365 

Si 

M cat tgt get get gca tct cca tat ccc aaa aga aag caa ata tgt gaa 1152 
O His Cys Ala Ala Ala Ser Pro Tyr Pro Lys Arg Lys Gin He Cys Glu 
O 370 375 380 

>,3 

3 ctg ttg eta aga aaa gga gca amc ate aat gaa aag act aaa gaa ttc 1 200 
Leu Leu Leu Arg Lys Gly Ala Xaa lie Asn Glu Lys Thr Lys Glu Phe 
385 390 395 400 

ttg act cct ctg cac gtg gca tct gag aaa get cat aat gat rtt gtt 1248 
Leu Thr Pro Leu His Val Ala Ser Glu Lys Ala His Asn Asp Xaa Val 
405 410 415 

gaa gta gtg gtg aaa cat gaa gca aag gtt aat get ctg gat aat ctt 1296 
Glu Val Val Val Lys His Glu Ala Lys Val Asn Ala Leu Asp Asn Leu 
420 425 430 

ggt cag act tct eta cac aga get gca tat tgt ggt cat eta caa acc 1344 
Gly Gin Thr Ser Leu His Arg Ala Ala Tyr Cys Gly His Leu Gin Thr 
435 440 445 
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tgc cgc eta etc ctg age tat ggg tgt gat cct aac att ata tec ctt 1392 
Cys Arg Leu Leu Leu Ser Tyr Gly Cys Asp Pro Asn He lie Ser Leu 
450 455 460 

cag ggc ttt act get tta cag atg gga aat gaa aat gta cag caa etc 1440 
Gin Gly Phe Thr Ala Leu Gin Met Gly Asn Glu Asn Val Gin Gin Leu 
465 470 475 480 

etc caa gag ggt ate tea tta ggt aat tea gag gca gac aga caa ttg 1488 
Leu Gin Glu Gly He Ser Leu Gly Asn Ser Glu Ala Asp Arg Gin Leu 
485 490 495 

ctg gaa get gca aag get gga gat gtc gaa act gta aaa »zz ctg tgt 1 536 
Leu Glu Ala Ala Lys Ala Gly Asp Val Glu Thr Val Lys Lys Leu Cys 
500 505 510 

act gtt cag agt gtc aac tgc aga gac att gaa ggg cgt cag tct aca 1584 
„ % Thr Val Gin Ser Val Asn Cys Arg Asp He Glu Gly Arg Gin Ser Thr 
Jj 515 520 525 

U 

1, 1 cca ctt cat ttt gca get ggg tat aac aga gtg tec gtg gtg gaa tat 1632 

C 3 Pro Leu His Phe Ala Ala Gly Tyr Asn Arg Val Ser Val Val Glu Tyr 
<5 530 535 540 

C3 

; * ctg eta cag cat gga get gat gtg cat;gct aaa gat aaa grr gsc;ctt 1680 

r i Leu Leu Gin His Gly Ala Asp Val His Ala Lys AspfLys XaaXaaLeu 

: <J 545 550 ~ 555: 560 

d. 

*.3 gta cct ttg cac aat gca tgt tct tat gga cat tat^gaa gtt gca gaa 1728;, 

^3 Val Pro Leu His Asn Ala Cys Ser Tyr Gly His Tyr Glu Val AJa Glu 
- 565 5703 575" 

ctt ctt gtt aaa cat gga gca gta gtt aat gta get gat tta tgg aaa; 1 776 
Leu Leu Val Lys His Gly Ala Val Val Asn Val Ala Asp Leu Trp Lys 
580 585 590 

ttt aca cct tta cat gaa gca gca gca aaa gga aaa tat gaa att tgc 1824 
Phe Thr Pro Leu His Glu Ala Ala Ala Lys Gly Lys Tyr Glu He Cys 
595 600 605 

aaa ctt ctg etc cag cat ggt gca gac cct aca aaa aaa aac agg gat 1872 
Lys Leu Leu Leu Gin His Gly Ala Asp Pro Thr Lys Lys Asn Arg Asp 
610 615 620 

gga aat act cct ttg gat ctt gtt aaa gat gga gat aca gat att caa:v 1920. 
Gly Asn Thr Pro Leu Asp Leu'Val Lys Asp Gly Asp Thr Asp He GLdu 
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625 630 635 640 

gat ctg ctt agg gga gat gca get ttg eta gat get gee aag aag agt 1 968 
Asp Leu Leu Arg Gly Asp Ala Ala Leu Leu Asp Ala Ala Lys Lys Ser 
645 650 655 

tgt tta gec aga gtg aag aag ttg tct tct cct gat aat gta aat tgc 2016 
Cys Leu Ala Arg Val Lys Lys Leu Ser Ser Pro Asp Asn Val Asn Cys 
660 665 670 

cgc gat acc caa ggc aga cat tea aca cct tta cat tta gca get ggt 2064 
Arg Asp Thr Gin Gly Arg His Ser Thr Pro Leu His Leu Ala Ala Gly 

675 680 .ft*5 

tat aat aat tta gaa gtt gca gag tat ttg tta caa cac gga get gat 2112 
Tyr Asn Asn Leu Glu Val Ala Glu Tyr Leu Leu Gin His Gly Ala Asp 
690 695 700 

> 2 gtg aat gee caa gac aaa gga gga ctt att cct tta cat aat gca gca 2160 

I j Val Asn Ala Gin Asp Lys Gly Gly Leu He Pro Leu His Asn Ala Ala 

II 705 710 715 720 

U 

*^ tct tac ggg cat gta gat gta gca get eta eta ata aag tat aat gca 2208 
H Ser Tyr Gly His Val Asp Val Ala Ala Leu Leu He Lys Tyr Asn Ala 
/ 725 730 735 

i j 

i tgt gtc aat gec acg gac aaa tgg get ttc aca cct ttg cac gaa gca 2256 
C 3 Cys Val Asn Ala Thr Asp Lys Trp Ala Phe Thr Pro Leu His Glu Ala 

^ 740 745 750 

W 

gec caa aag gga cga aca cag ctt tgt get ttg ktg eta gec cat gga 2304 
Ala Gin Lys Gly Arg Thr Gin Leu Cys Ala Leu Xaa Leu Ala His Gly 
755 760 765 

get gac ccg act ctt aaa aat cag gaa gga caa aca cct tta gat tta 2352 
Ala Asp Pro Thr Leu Lys Asn Gin Glu Gly Gin Thr Pro Leu Asp Leu 
770 775 780 

gtt tea gcg gat gat gtc age get ctt ctg aca gca gec atg ccc cca 2400 
Val Ser Ala Asp Asp Val Ser Ala Leu Leu Thr Ala Ala Met Pro Pro 
785 790 795 800 

tct get ctg ccc tct tgt tac aag cct caa gtg etc aat ggt gtg aga 2448 
Ser Ala Leu Pro Ser Cys Tyr Lys Pro Gin Val Leu Asn Gly Val Arg 
805 810 815 
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age cca gga gec act gca gat get etc tct tea ggt cca tct age cca 2496 
Ser Pro Gly Ala Thr Ala Asp Ala Leu Ser Ser Gly Pro Ser Ser Pro 
820 825 830 

tea age ctt tct gca gee age agt ctt gac aac tta tct ggg agt ttt 2544 
Ser Ser Leu Ser Ala Ala Ser Ser Leu Asp Asn Leu Ser Gly Ser Phe 
835 840 845 

tea gaa ctg tct tea gta gtt agt tea agt gga aca gag ggt get tec 2592 
Ser Glu Leu Ser Ser Val Val Ser Ser Ser Gly Thr Glu Gly Ala Ser 
850 855 860 

agt ttg gag aaa a<t& gag gtt cca gga gta gat ttt age ata act caa 2640 
Ser Leu Glu Lys Lys Glu Val Pro Gly Val Asp Phe Ser He Thr Gin 
865 870 875 880 

ttc gta agg nat ctt gga ctt gag cac eta atg gat ata ttt gag aga 2688 
Phe Val Arg Xaa Leu Gly Leu Glu His Leu Met Asp lie Phe Glu Arg 
W 885 890 895 

I I gaa cag ate act ttg gat gta tta gtt gag atg ggg cac aag gag ctg 2736 

C 3 Glu Gin lie Thr Leu Asp Val Leu Val Glu Met Gly His Lys Glu Leu 
U 900 905 910 

p 

1 J aag gag att gga ate aat get tat gga cat agg cac aaa eta att aaa 2784 
L Lys Glu lie Gly He Asn Ala Tyr Gly His Arg His Lys Leu lie Lys 

III 915 920 925- 

a 

k 3 gga gtc gag aga ctt ate tec gga caa caa ggt ctt aac. cca tat tta : 2832_: 

3 Gly Val Glu Arg Leu He Ser Gly Gin Gin Gly Leu Asn Pro Tyr Leu 

^ 930 935 940,, 

act ttg aac acc tct ggt agt gga aca att ctt ata gat ctg tct cct 2880 
Thr Leu Asn Thr Ser Gly Ser Gly Thr He Leu He Asp Leu Ser Pro 
945 950 955 960 

gat gat aaa gag ttt cag tct gtg gag gaa gag atg caa agt aca gtt 2928 
Asp Asp Lys Glu Phe Gin Ser Val Glu Glu Glu Met Gin Ser Thr Val 
965 970 975 

cga gag cac aga gat gga ggt cat gca ggt gga ate ttc aac aga tac 2976 
Arg Glu His Arg Asp Gly Gly His Ala Gly Gly He Phe Asn Arg Tyr 
980 " 985 990 

aat att etc aag;att cag aag gtt tgt aatf aag aaa eta tgg.gaa aga ^3024 
Asn He Leu Lys. He Gin Lys Val Cys Asn Lys Lys Leu Trp Glu Arg 
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995 1000 1005 

tac act cac egg aga aaa gaa gtt tct gaa gaa aac cac aac cat gec 3072 
Tyr Thr His Arg Arg Lys GIu Val Ser Glu Glu Asn His Asn His Ala 
1010 1015 1020 

aat gaa cga atg eta ttt cat ggg tct cct ttt gtg aat gca att ate M20 
Asn Glu Arg Met Leu Phe His Gly Ser Pro Phe Val Asn Ala lie He 
1025 1030 1035 1040 

cac aaa ggc ttt gat gaa agg cat gcg tac ata ggt ggt atg ttt gga 3168 
His Lys Gly Phe Asp Glu Arg His Ala Tyr lie Gly Gly Met Phe Gly 
1045 1050 1055 

get ggc att tat ttt get gaa aac tct tec aaa age aat caa tat gta 3216 
Ala Gly He Tyr Phe Ala Glu Asn Ser Ser Lys Ser Asn Gin Tyr Val 
1060 1065 1070 

1-3 

1 3 tat gga att gga gga ggt act ggg tgt cca gtt cac aaa gac aga tct 3264 
U Tyr Gly lie Gly Gly Gly Thr Gly Cys Pro Val His Lys Asp Arg Ser 
11 1075 1080 1085 

a 

. k 

tgt tac att tgc cac agg cag ctg etc ttt tgc egg gta acc ttg gga 3312 
1 Cys Tyr lie Cys His Arg Gin Leu Leu Phe Cys Arg Val Thr Leu Gly 
1090 1095 1100 

a 

*'4 aag tct ttc ctg cag ttc agt gca atg aaa atg gca cat tct cct cca 3360 
Lys Ser Phe Leu Gin Phe Ser Ala Met Lys Met Ala His Ser Pro Pro 
1105 1110 1115 1120 
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ggt cat cac tea gtc act ggt agg ccc agt gta aat ggc eta gca tta 3408 
Gly His His Ser Val Thr Gly Arg Pro Ser Val Asn Gly Leu Ala Leu 
1125 1130 1135 

get gaa tat gtt att tac aga gga gaa cag get tat cct gag tat tta 3456 
Ala Glu Tyr Val lie Tyr Arg Gly Glu Gin Ala Tyr Pro Glu Tyr Leu 
1140 1145 1150 

att act tac cag att atg agg cct gaa ggt atg gtc gat gga 3498 
He Thr Tyr Gin lie Met Arg Pro Glu Gly Met Val Asp Gly 
1155 1160 1165 



<210>5 

<211>1166 

<212>PRT 
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<213> Homo sapiens 
<400> 5 

Met Ser Gly Arg Arg Cys Ala Giy Gly Gly Ala Ala Cys Ala Ser Ala 
1 5 10 15 

Ala Ala Glu Ala Val Glu Pro Ala Ala Arg Glu Leu Phe Glrr Ala Cys 
2Q _ 25_ 3(L~ 

Arg Asn Gly Asp Val Glu Arg Val Lys Arg Leu Val Thr Pro Glu Lys 
35 40 45 

Val Asn Ser Arg Asp Thr Ala Gly Arg Lys Ser Thr Pro Leu His Leu 
50 55 60 

Ala Ala Gly Phe Gly Arg Lys Asp Val Val Glu Tyr Leu Leu Gin Asn 
65 70 75 80 

f 3 

;\ Gly Ala Asn Val Gin Ala Arg Asp Asp Gly Gly Leu He Pro Leu His 
LJ 85 90 95 

! : 1 Asn Ala Cys Ser Phe Gly His Ala Glu Val Val Asn Leu Leu Leu Arg 
P 100 105 110 

|: 1 His Gly Ala Asp Pro Asn Ala Arg Asp Asn Trp Asn.Tyr Thr Pro Leu 
; 5 115 120 125^ 

« 

SI His Glu Ala Ala lie Lys Gly Lys He Asp rVal.Cys He Val Leu Leu - 

130 135 140 

si 

; i Gin His Gly Ala Qlu Pro Thr He Arg Asa Thr Asp Gly Arg Ala 
M 145 150 155 160 

Leu Asp Leu Ala Asp Pro Ser Ala Lys Ala Val Leu Thr Gly Glu Tyr 
165 170 175 

Lys Lys Asp Glu Leu Leu Glu Ser Ala Arg Ser Gly Asn Glu Glu Lys 
180 185 190 

Met Met Ala Leu Leu Thr Pro Leu Asn Val Asn Cys His Ala Ser Asp 
195 200 205 

Gly Arg Lys Ser Thr Pro Leu His Leu Ala Ala Gly Tyr Asn Arg Val 
210 215 220 

Lys He Val Gin Leu Leu Leu Gin His Gly Ala Asp Val His Ala Lys 

225 230 . 235 240 
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Asp Lys Gly Asp Leu Val Pro Leu His Asn Ala Cys Ser Tyr Gly His 
245 250 255 

Tyr Glu Val Thr Glu Leu Leu Val Lys His Gly Ala Cys Val Asn Ala 
260 265 270 

Met Asp Leu Tip Gin Phe Thr Pro Leu His Glu Ala Ala Ser Lys Asn 
275 280 285 

Arg Val Glu Val Cys Ser Leu Leu Leu Ser Tyr Gly Ala Asp Pro Thr 
290 295 300 

Leu Leu Asn Cys His Asn Lys Ser Ala He Asp Leu Ala Pro Thr Pro 
305 310 315 320 

Gin Leu Lys Glu Arg Leu Ala Tyr Glu Phe Lys Gly His Ser Leu Leu 
325 330 335 

j 1 3 Gin Ala Ala Arg Glu Ala Asp Val Thr Arg He Lys Lys His Leu Ser 
y 340 345 350 

* = 

& ! 

p Leu Glu Met Val Asn Phe Lys His Pro Gin Thr His Glu Thr Ala Xaa 

0 355 360 365 

1 - 1 His Cys Ala Ala Ala Ser Pro Tyr Pro Lys Arg Lys Gin He Cys Glu 
L 370 375 380 

A Leu Leu Leu Arg Lys Gly Ala Xaa He Asn Glu Lys Thr Lys Glu Phe 

O 385 390 395 400 

S.J 

s, 3 Leu Thr Pro Leu His Val Ala Ser Glu Lys Ala His Asn Asp Xaa Val 
405 410 415 

Glu Val Val Val Lys His Glu Ala Lys Val Asn Ala Leu Asp Asn Leu 
420 425 430 

Gly Gin Thr Ser Leu His Arg Ala Ala Tyr Cys Gly His Leu Gin Thr 
435 440 445 

Cys Arg Leu Leu Leu Ser Tyr Gly Cys Asp Pro Asn lie He Ser Leu 
450 455 460 

Gin Gly Phe Thr Ala Leu Gin Met Gly Asn Glu Asn Val Gin Gin Leu 
465 470 475 480 

Leu Gin Glu Gly He Ser Leu Gly Asn Ser Glu Ala Asp Arg Gin Leu 
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485 490 495 

Leu Glu Ala Ala Lys Ala Gly Asp Val Glu Thr Val Lys Lys Leu Cys 
500 505 510 

Thr Val Gin Ser Val Asn Cys Arg Asp He Glu Gly Arg Gin Ser Thr 
515 520 525 

Pro Leu His Phe Ala Ala Gly Tyr Asn Arg Val Ser Val Val Glu Tyr 
530 535 540 

Leu Leu Gin His Gly Ala Asp Val His Ala Lys Asp Lys Xaa Xaa Leu 
545 550 555 560 

Val Pro Leu His Asn Ala Cys Ser Tyr Gly His Tyr Glu Val Ala Glu 
565 570 575 

Leu Leu Val Lys His Gly Ala Val Val Asn Val Ala Asp Leu Tip Lys 
R 580 585 590 

U 

1 1 Phe Thr Pro Leu His Glu Ala Ala Ala Lys Gly Lys Tyr Glu He Cys 

H 595 600 605 

O 

\\ 4 Lys Leu Leu Leu Glh His Gly Ala Asp Pro Thr Lys Lys Asn Arg Asp 

N 610 615 620 

C3 

*. j Gly Asn Thr Pro Leu Asp Leu Val Lys Asp Gly Asp Thr Asp He Gin 

£3 625 630 r 635 640 

O 

3 Asp Leu Leu Arg Gly Asp Ala Ala Leu Leu Asp Ala Ala Lys Lys Ser 

■= 3 645 650, 655 

Cys Leu Ala Arg Val Lys Lys Leu Ser Ser Pro Asp Asn Val Asn Cys 
660 665 670 

Arg Asp Thr Gin Gly Arg His Ser Thr Pro Leu His Leu Ala Ala Gly 
675 680 685 

Tyr Asn Asn Leu Glu Val Ala Glu Tyr Leu Leu Gin His Gly Ala Asp 
690 695 700 

Val Asn Ala Gin Asp Lys Gly Gly Leu He Pro Leu His Asn Ala Ala 
705 710 715 720 

Ser Tyr Gly His Val Asp Val Ala Ala Leu LeuTle Lys Tyr Asn Ala • 
725 730 735 
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^ Ser Ser Leu Ser Ala Ala Ser Ser Leu Asp Asn Leu Ser Gly Ser Phe 
U 835 840 845 

11 

£ 3 Ser Glu Leu Ser Ser Val Val Ser Ser Ser Gly Thr Glu Gly Ala Ser 

H «50 855 860 

\,j 

FJ 

; ' Ser Leu Glu Lys Lys Glu Val Pro Gly Val Asp Phe Ser He Thr Gin 
» 865 870 875 880 

M 

£3 Phe Val Arg Xaa Leu Gly Leu Glu His Leu Met Asp He Phe Glu Arg 
*>3 885 890 895 

3 Glu Gin lie Thr Leu Asp Val Leu Val Glu Met Gly His Lys Glu Leu 
900 905 910 

Lys Glu lie Gly He Asn Ala Tyr Gly His Arg His Lys Leu He Lys 
915 920 925 

Gly Val Glu Arg Leu He Ser Gly Gin Gin Gly Leu Asn Pro Tyr Leu 
930 935 940 

Thr Leu Asn Thr Ser Gly Ser Gly Thr He Leu He Asp Leu Ser Pro 
945 950 955 960 

Asp Asp Lys Glu Phe Gin Ser Val Glu Glu Glu Met Gin Ser Thr Val 
965 970 975 

Arg Glu His Arg Asp Gly Gly His Ala Gly Gly He Phe Asn Arg Tyr 



Cys Val Asn Ala Thr Asp Lys Tip Ala Phe Thr Pro Leu His Glu Ala 
740 745 750 

Ala Gin Lys Gly Arg Thr Gin Leu Cys Ala Leu Xaa Leu Ala His Gly 
755 760 765 

Ala Asp Pro Thr Leu Lys Asn Gin Glu Gly Gin Thr Pro Leu Asp Leu 
770 775 780 

Val Ser Ala Asp Asp Val Ser Ala Leu Leu Thr Ala Ala Met Pro Pro 
7«5 790 795 800 

Ser Ala Leu Pro Ser Cys Tyr Lys Pro Gin Val Leu Asn Gly Val Are 
805 810 815 

Ser Pro Gly Ala Thr Ala Asp Ala Leu Ser Ser Gly Pro Ser Ser Pro 
820 825 830 
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980 985 990 



Asn He Leu Lys He Gin Lys Val Cys Asn Lys Lys Leu Trp Glu Arg 
995 1000 1005 

Tyr Thr His Arg Arg Lys Glu Val Ser Glu Glu Asn His Asn His Ala 
1010 1015 1020 

Asn Glu Arg Met Leu PR HSXHy Se? Pf8 PR Val Asn AMeTIe* 
1025 1030 1035 1040 

His Lys Gly Phe Asp Glu Arg His Ala Tyr He Gly Gly Met Phe Gly 
1045 1050 1055 

Ala Gly lie Tyr Phe Ala Glu Asn Ser Ser Lys Ser Asn Gin Tyr Val 
1060 1065 1070 

f , , Tyr Gly He Gly Gly Gly Thr Gly Cys Pro Val His Lys Asp Arg Ser 
H 1075 1080 1085 

U 

!= 1 Cys Tyr He Cys His Arg Gin Leu Leu Phe Cys Arg Val Thr Leu Gly 

M 1090 1095 1100 

*,3 

):1 Lys Ser Phe Leu Gin Phe Ser Ala Met Lys Met Ala His Ser Pro Pro 
'„* 1105 1110 1115 1120 

£3 

* 4 Gly His His Ser Val Thr Gly Afg Pro Ser Val Asn Gly Leu AlaXeu 
H 1125 ~ 1130. 1135 

O 

Ala Glu Tyr Val He Tyr Arg.Gly'Glu Gin Ala Tyr Pro GluTyr Leu 
1140 1145: 1150 

He Thr Tyr Gin He Met Arg Pro Glu Gly Met Val Asp Gly 
1155 1160 1165 



<210>6 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primers 
<400> 6 

cccgagagct gttcgaggc 19 
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<210>7 
<211> 23 
<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primers 
<400> 7 

caatctttac tctgttatat cct 23 



<210>8 

<211>41 

<212>DNA 

<213> Artificial Sequence 
11 <220> 

I J <223> Description of Artificial Sequence: PCR Primers 

11 

O <400> 8 

j;| aagcggccgc attatggaaa ggatcatgtc gggtcgccgc t 41 

[ J <210>9 
<211>25 
•4 <212>DNA 
£3 <213> Artificial Sequence 

K l <220> 

% " <223> Description of Artificial Sequence: PCR Primers 
<400> 9 

aaggatccac cataccttca ggcct 25 



<210> 10 
<211>39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primers 
<400> 10 

aaaagcttta tggaaaggat catgtcgggt cgccgctgc 39 
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